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1 GACAGAGTGCAGCCTTTTCAGACTCTGTGACACAGTTCCCCTTTT 
4 6 GCAAAAATACTTAGCGAGGATCATTACTTTCCAACAGTCGTGTCC 
9 1 AGAGACCTACTTTGTAACACCGCAGGGAAGTTAATGTACTAGGTC 
136 TTGAAAGGTCTTTCTGGAATGTGC AGTAACTTGTAGTTTTCTTCT 
181 A6TAGCACTGCTAATTTTTGTGTTATAATTTTTGTAGGTCC ATGG 

226 G6CCGATGTATGGGAGATGAATGTGGTCCCGGAGGCATCCAAACG 

MetGlyAspGluCysGlyProGlyGlylleGlnThr 

271 AGGGCTGTGTGGTGTGCTCATGTGGAGGGATGGACTAC ACTGCAT 
ArgAlaValTrpCysAlaHisValGluGlyTrpThrThrLeuHis 

316 ACTAACTGTAAGC AGGCCGAGAGACCC AATAACC AGO AGAATTGT 
ThrAsnCysLysGlnAlaGluArgProAsnAsnGlnGlnAsnCys 

361 TTCAAAGTTTGCGATTGGCACAAAGAGTTGTACGACTGGAGACTG 
PheliysValCysAspTrpHisLysGluLeuTyrAspTrpArgLeu 

406 GGACCTTGGAATCAGTGTCAGCCCGTGATTTCAAAAAGCCTAGAG 
GlyProTrpAsnGlnCysGlnProVallleSerLysSerljeuGlu 

451 AAACCTCTTGAGTGCATTAAGGGGGAAGAAGGTATTCAGGTGAGG 
LysProIieuGluCysIleLysGlyGluGluGlylleGliiValArg 

496 GAGATAGCGTGCATCCAGAAAGACAAAGACATTCCTGCGGAGGAT 
GluIleAlaCysIleGlnLysAspLysAspIleProAlaGluAsp 

541 ATCATCTGTGAGTACTTTGAGCCCAAGCCTCTCCTGGAGC AGGCT 
IlelleCysGluTyrPheGluProLysProLeuLeuGluGlnAla 

586 TGCCTCATTCCTTGCCAGC AAGATTGCATCGTGTCTGAATTTTCT 
CysLeuIleProCysGlnGlnAspCysIleValSerGluPheSer 

631 GCCTGGTCCGAATGCTCCAAGACCTGCGGCAGCGGGCTCCAGCAC 
AlaTrpSerGluCysSerLysThrCysGlySerGlylieuGlnHis 

676 CGGACGCGTCATGTGGTGGCGCCCCCGCAGTTCGGAGGCTCTGGC 
ArgThrArgHisValValAlaProProGlnPheGlyGlySerGly 

721 TGTCCAAACCTGACGGAGTTCCAGGTGTGCCAATCCAGTCCATGC 
CysProAsnLeuThrGluPheGlnValCysGlnSerSerProCys 

766 GAGGCCGAGGAGCTCAGGTACAGCCTGCATGTGGGGCCCTGGAGC 
GluAlaGluGluLeuArgTyrSerLeuHisValGlyProTirpSer 

811 ACCTGCTCAATGCCCCACTCCCGACAAGTAAGACAAGCAAGGAGA 
ThrCysSerMetProHisSerArgGlnValArgGlnAlaArgArg 
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856 CGCGGGAAGAATAAAGAACGGGAAAAGGACCGC AGC AAAGGAGTA 
ArgGlyliysAsnLysGlxiArgGluLysAspArgSerLysGlyVal 

901 AAGGATCCAGAAGCCCGCGAGCTTATTAAGAAAAAGAGAAAC AGA 
LysAspProGluAlaArgGluLeuIleLysIiysLysArgAsnArg 

946 AAC AGGC AGAACAGAC AAGAGAACAAATATTGGGACATCC AGATT 
AsiiArgGlnAsnAr gGlnGluAsnLy sTyrTrpAspI leGlnl le 

991 GGATATCAGACC AGAGAGGTTATGTGCATTAACAAGACGGGGAAA 
GlyTyrGlnThrArgGluValMetCysIleAsnLysThrGlyliys 

1036 GCTGCTGATTTAAGCTTTTGCCAGC AAGAGAAGCTTCC AATGACC 
AlaAlaAspLeuSerPheCysGlnGlnGluLysIjeuProMetTlir 

1081 TTCCAGTCCTGTGTGATCACCAAAGAGTGCCAGGTTTCCGAGTGG 
PheGlnSerCysVallleThrLysGluCysGlnValSerGluTrp 

112 6 TCAGAGTGGAGCCCCTGCTCAAAAACATGCCATGACATGGTGTCC 
SerGluTrpSerProCysSerliysThrCysHisAspMetValSer 

1171 CCTGCAGGC ACTCGTGTAAGGACACGAACC ATCAGGCAGTTTCCC 
ProAlaGlyThrArgValArgThrArgThrlleArgGlnPhePro 

1216 ATTGGCAGTGAAAAGGAGTGTCCAGAATTTGAAGAAAAAGAACCC 
IleGlySerGluIiysGluCysProGluPheGluGluIjysGluPro 

1261 TGTTTGTCTCAAGGAGATGGAGTTGTCCCCTGTGCCACGTATGGC 
CysIieuSerGlnGlyAspGlyValValProCysAlaThrTyrGly 

1306 TGGAGAACTACAGAGTGGACTGAGTGCCGTGTGGACCCTTTGCTC 
TrpArgThrThrGluTrpThrGluCysArgValAspProLeuLeu 

1351 AGTCAGCAGGACAAGAGGCGCGGCAACCAGACGGCCCTCTGTGGA 
SerGlnGlnAspLysArgArgGlyAsnGlnThrAlaLeuCysGly 

1396 GGGGGCATCCAGACCCGAGAGGTGTACTGCGTGCAGGCCAACGAA 
GlyGlylleGlnThrArgGluValTyrCysValGlnAlaAsnGlu 

1441 AACCTCCTCTCACAATTAAGTACCCACAAGAACAAAGAAGCCTCA 
AsiUjeuLeuSerGlnlieuSerThrHisIfysAsnliysGluAlaSer 

1486 AAGCCAATGGACTTAAAATTATGCACTGGACCTATCCCTAATACT 
LysProMetAspIieuLysLeuCysThrGlyProIleProAsnThr 

1531 AC ACAGCTGTGCCACATTCCTTGTCC AACTGAATGTGAAGTTTC A 
ThrGlnljeuCysHisIleProCysProThrGluCysGluValSer 
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1576 CCTTGGTC AGCTTGGGGACCTTGTACTTATGAAAACTGTAATGAT 
ProTrpSerAlaTrpGlyProCysThrTyrGltiAsnCysAsnAsp 

1621 CAGCAA6GGAAAAAAGGCTTCAAACTGAGGAAGC66CGCATTACC 
GlnGlnGlyliysLysGlyPheliysIjeuArgliysArgArglleThr 

1666 AATGAGCCCACTGGAGGCTCTGGGGTAACCGGAAACTGCCCTCAC 
AsnGluProThrGlyGlySerGlyValThrGlyAsnCysProHis 

1711 TTACTGGAAGCCATTCCCTGTGAAGAGCCTGCCTGTTATGACTGG 
LeuIieuGluAl al 1 e Pr oCy sGluGluPro Al aCy sTy r AspTrp 

1756 AAAGCGGTGA6ACTGGGAGACTGCGAGCCAGATAACGGAAAGGA6 
I<ysAlaValArgI<eu61yAspCysGluProAspAsn6lyIjysGlu 

1801 TGTGGTCCAG6C AC6CAAGTTC AAGAG6TTGT6TGC ATC AACAGT 
CysGlyProGlyThrGlnValGlnGluValValCysIleAsnSer 

1846 GATGGAGAAGAAGTTGACA6AC A6CTGTGC A6A6ATGCC ATCTTC 
AspGlyGlu61uValAspArg61nl>euCysArgAspAlaIlePhe 

1891 CCCATCCCTGTGGCCTGTGATGCCCCATGCCCGAAAGACTGTGTG 
Pr ol 1 e ProVa 1 Al aCy s AspAl aPr oCy s Pr oLy s AspCy s Va 1 

1936 CTCAGCACATGGTCTACGTGGTCCTCCTGCTCACAC ACCTGCTCA 
LeuSerThrTrpSerThrTrpSerSerCysSerHisThrCysSer 

1981 GGGAAAACGACA6AAGGGAAAC A6ATAC6AGCAC6ATCCATTCT6 
GlyLysTlirThrGluGlyliysGliiIleArgAlaArgSerllelieu 

2026 GCCTATGCGGGTGAAGAA66T66AATTC6CTGTCCAAATAGCAGT 
AlaTyrAlaGlyGluGluGlyGlylleArgCysProAsnSerSer 

2071 GCTTTGCAA6AA6TAC6AAGCTGTAAT6AGCATCCTTGCACAGT6 
AlaI<eu6lnGluValArgSerCysAsn6luHisProCysThrVal 

2116 TACCACTGGCAAACT66TCCCT6GGGCCA6T6CATT6A66ACACC 
TyrHisTrpGlnThrGlyProTrpGlyGlnCysIleGluAspThr 

2161 TCA6TATC6TCCTTCAACACAACTACGACTTGGAATG6GGAGGCC 
SerValSerSerPheAsziThrThzrTlirTlirTrpAsnGlyGluAla 

2206 TCCTGCTCTGTCGGCATGCAGACAAGAAAAGTC ATCTGT6TGCGA 
SerCy s SerValGlyMetGlnThr ArgLy s Val 1 1 eCy s ValArg 

2251 6TCAAT6TGGGCC AAGTGGGACCCAAAAAAT6TCCTGAAA6CCTT 
ValAsnValGly61nVal61yProI<ysIjysCysPro6luSen<eu 
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2296 CGACCTGAAACTGTAAGGCCTTGTCTGCTTCCTTGTAAGAAGGAC 
ArgProGluThrValArgProCysLeuLeuProCysLysLysAsp 

2341 TGTATTGTGACCCCATATAGTGACTGGACATCATGCCCCTCTTCG 
CysIleValThrProTyrSerAspTrpThrSerCysProSerSer 

2386 TGTAAAGAAGGGGACTCCAGTATCAGGAAGCAGTCTAGGCATCGG 
CysLysGluGlyAspSerSerlleArgLysGlnSerArgHisArg 

2431 GTCATC ATTCAGCTGCC AGCC AACGGGGGCCGAGACTGCAC AGAT 
ValllelleGlnljeuProAlaAsnGlyGlyArgAspCysThrAsp 

2476 CCCCTCTATGAAGAGAAGGCCTGTGAGGCACCTC AAGCGTGCCAA 
ProIieuTyrGluGluLysAlaCysGluLAlaProGlnAlaCysGln 

2521 AGCTACAGGTGGAAGACTCACAAATGGCGC AGATGCC AATTAGTC 
SerTyrArgTrpIjysThrHisLysTrpArgArgCysGlnljeuVal 

2566 CCTTGGAGCGTGC AACAAGACAGCCCTGGAGCACAGGAAGGCTGT 
ProTrpSerValGlnGlnAspSerProGlyAlaGlnGluGlyCys 

2611 GGGCCTGGGCGACAGGCAAGAGCCATTACTTGTCGCAAGCAAGAT 
GlyProGlyArgGlnAlaArgAlalleThrCysArgLysGlnAsp 

2656 GGAGGAC AGGCTGGAATCC ATGAGTGCCTACAGTATGCAGGCCCT 
GlyGlyGlnAlaGlylleHisGluCysLeuGlnTyrAlaGlyPro 

2701 GTGCCAGCCCTTACCCAGGCCTGCCAGATCCCCTGCCAGGAT6AC 
ValProAlalieuThrGlnAlaCysGlnlleProCysGlnAspAsp 

2746 TGTCAATT6ACCAGCTGGTCCAAGTTTTCTTCATGCAATGGAGAC 
CysGlnljeuThrSerTrpSerLysPlieSerSerCysAsnGlyAsp 

2791 TGTGGTGCAGTTAGGACCAGAAAGCGCACTCTTGTTGGIUUU^GT 
Cy sGlyAl aVa 1 Ar gThr Ar gLy s Ar gThrlieuValGly Ly s S er 

2836 AAAAAGAAGGAAAAAT6TAAAAATTCCC ATTTGTATCCCCTGATT 
LysLysIiysGluLysCysLysAsnSerHisIieuTyrProLeuIle 

2881 GAGACTCAGTATTGTCCTTGTGACAAATATAATGCACAACCTGTG 
GluThrGlnTyrCysProCysAspLysTyrAsnAlaGlnProVal 

2926 GGGAACTGGTCAGACTGTATTTTACC AGAGGGAAAAGTGGAAGTG 
GlyAsnTrpSerAspCysIleLeuProGluGlyLysValGluVal 

2971 TTGCTGGGAATGAAAGTAC AAGGAGAC ATC AAGGAATGCGGACAA 
LeuLeuGlyMetLysValGlnGlyAspIleLysGluCysGlyGln 
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3016 GGATATCGTTACCAAGCAATGGCATGCTACGATCAAAATGGCAGG 
GlyTyrArgTyrGlnAlaMetAlaCysTyrAspGlnAsnGlyArg 

3061 CTTGTGGAAACATCTAGATGTAACAGCCATGGTTACATTGAGGAG 
LeuValGluThrSerArgCysAsnSerHisGlyTyrlleGluGlu 

3106 GCCTGCATCATCCCCTGCCCCTCAGACTGCAAGCTCAGTGAGTGG 
AlaCysIlelleProCysProSerAspCysLysLeuSerGluTrp 

3151 TCCAACTGGTCGCGCT6C AGCAAGTCCTGTGGGAGTGGTGTGAAG 
SerAsnTrpSerArgCysSerLysSerCysGlySerGlyValLys 

3196 GTTCGTTCTAAATGGCTGCGTGAAAAACCATATAATGGAGGAAGG 
ValArgSerLysTrpLeuArgGluLysProTyrAsnGlyGlyArg 

3241 CCTTGCCCC AAACTGGACCATGTCAACC AGGCAC AGGTGTATGAG 
ProCysProLysLeuAspHisValAsnGlnAlaGlnValTyrGlu 

3286 GTTGTCCCATGCCACAGTGACTGCAACCAGTACCTATGGGTCACA 
ValValProCysHisSerAspCysAsnGlnTyrljeuTrpValThr 

3331 GAGCCCT6GAGCATCTGCAAGGTGACCTTTGTGAATATGCGGGAG 
GluProTrpSerlleCysLysValThrPheValAsnMetArgGlu 

3376 AACTGTGGAGAGGGCGTGCAAACCCGAAAAGTGAGATGCATGCAG 
AsnCysGlyGluGlyValGlnThrArgLysValArgCysMetGln 

3421 AATACAGCAGATGGCCCTTCTGAACATGTAGAGGATTACCTCTGT 
AsnThrAlaAspGlyProSerGluHisValGliiAspTyrLeuCys 

3466 GACCCAGAA6AGATGCCCCTGGGCTCTAGAGTGTGCAAATTACCA 
AspProGluGlxiMetPx-oLeuGlySerArgValCysLysLeuPro 

3511 TGCCCTGAGGACTGTGTGATATCTGAATGGGGTCCATGGACCCAA 
CysProGluAspCysVallleSerGluTrpGlyProTrpThrGln 

3556 TGTGTTTTGCCTTGCAATCAAAGC AGTTTCCGGCAAAGGTCAGCT 
CysVallieuProCysAsnGlnSerSerPheArgGlnArgSerAla 

3601 GATCCC ATC AGACAACCAGCTGATGAAGGAAGATCTTGCCCTAAT 
AspProIleArgGlnProAlaAspGluGlyArgSerCysProAsn 

3646 GCTGTTGA6AAAGAACCCTGTAACCT6AACAAAAACTGCTACCAC 
AlaValGluIiysGluProCysAsnLeuAsnliysAsnCysTyrHis 

3691 TATGATTATAATGTAACAGACTGGAGTACATGTCAGCTGAGT6AG 
TyrAspTyrAsnValThrAspTrpSerThrCysGlnLeuSerGlu 
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3736 AAGGC AGTTTGTGGAAATGGAATAAAAAC AAGGATGTTGGATTGT 
LysAlaValCysGlyAsnGlylleLysThrArgMetLeuAspCys 

3781 GTTCGAAGTGATGGC AAGTC AGTTGACCTGAAATATTGTGAAGCG 
ValArgSerAspGlyLysSerValAspLeuLysTyrCysGluAla 

3826 CTT6GCTTGGAGAAGAACTGGCAGATGAACACGTCCTGC ATGGTG 
LeuGlyLeuGluIjysAsnTrpGlnMetAsnThzrSerCysMetVal 

3871 GAATGCCCTGTGAACTGTC AGCTTTCTGATTGGTCTCCTTGGTCA 
GluCysProValAsnCysGlnLeuSerAspTrpSerProTrpSer 

3916 GAATGTTCTCAAACATGTGGCCTCACAGGAAAAATGATCCGAAGA 
GluCysSerGlnThrCysGlyLeuThrGlyliysMetlleArgArg 

3961 CGAAC AGTGACCC AGCCCTTTCAAGGTGATGGAAGACCATGCCCT 
ArgThrValThrGlnProPheGlnGlyAspGlyArgProCysPro 

4006 TCCCTGATGGACCAGTCCAAACCCTGCCC AGTGAAGCCTTGTTAT 
SerljeuMetAspGlnSerljysProCysProValliysProCysTyr 

4051 CGGTGGCAATATGGCCAGTGGTCTCC ATGCCAAGTGC AGGAGGCC 
ArgTrpGlnTyrGlyGlnTrpSerProCysGlnValGlnGluAla 

4096 C AGTGTGGAGAAGGGACC AGAAC AAGGAACATTTCTTGTGTAGTA 
GlnCysGlyGluGlyThrArgThrArgAsnlleSerCysValVal 

4141 AGTGATGGGTCAGCTGATGATTTCAGCAAAGTGGTGGATGAGGAA 
SerAspGlySerAlaAspAspPheSerLysValValAspGluGlu 

4186 TTCTGTGCTGACATTGAACTCATTATAGATG6TAATAAAAATATG 
PheCy s Al aAspX 1 eGluLeuIl ell e AspGlyAsnLy s AsnMe t 

4231 GTTCTGGAGGAATCCTGCAGCCAGCCTTGCCCAGGTGACTGTTAT 
ValLeuGluGluSerCysSerGlnProCysProGlyAspCysTyr 

4276 TTGAAGGACTGGTCTTCCTGGAGCCTGTGTCAGCT6ACCTGTGTG 
LeuLysAspTrpSerSerTirpSerLeuCysGlnLeuThrCysVal 

4321 AATGGTGAGGATCTAGGCTTTGGTGGAATACAGGTC AGATCCAGA 
AsnGlyGluAspIieuGlyPheGlyGlylleGlnValArgSerArg 

4366 CCGGTGATTATAC AAGAACTAGAGAATCAGCATCTGTGCCCAGAG 
ProValllelleGlnGluLeuGluAsnGlnHisLeuCysProGlu 

4411 CAGAT6TTAGAAACAAAATC ATGTTATGATG6ACAGT6CTATGAA 
GlnMetLeuGluThrLysSerCysTyrAspGlyGlnCysTyrOlu 

4456 TATAAATG6ATGGCCAGTGCTTGGAA6GGCTCTTCCC6AACAGT6 
TyrI<ysTrpMetAlaSerAlaTrpIiys61ySerSerArgThrVal 
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4501 TGGTGTC AAAGGTC AGATGGTATAAATGTAAC AGGGGGCTGCTTG 
TocpCysGlnArgSerAspGlylleAsnValThrGlyGlyCysLeu 

4546 GTGAT6AGCC AGCCTGATGCCGAC AGGTCTTGTAACCC ACCGTGT 
ValMetSerGlnProAspAlaAspArgSerCysAsnProProCys 

4591 AGTC AACCCCACTCGTACTGTAGCGAGACAAAAACATGCCATTGT 
SerGlnProHisSerTyrCysSerGluThrLysThrCysHisCys 

4636 GAAGAAGGGTACACTGAAGTCATGTCTTCTAACAGC ACCCTTGAG 
GluGluGlyTyrThrGluValMetSerSerAsnSerThrLeuGlu 

4681 CAATGCACACTTATCCCCGTGGTGGTATTACCCACCATGGAGGAC 
GlnCysThrlieuIleProValValVaHjeuProThrMetGliiAsp 

4726 AA2VAGAGGAGATGTGAAAACC AGTCGGGCTGTAC ATCC AACCC AA 
LysArgGlyAspValLysThrSerArgAlaValHisProThrGln 

4771 CCCTCCAGTAACCC AGO AGGACGGGGAAGGACCTGGTTTCTAC AG 
ProSerSerAsnProAlaGlyArgGlyArgThrTrpPheLeuGln 

4816 CC ATTTGGGCCAGATGGGAGACTAAAGACCTGGGTTTACGGTGTA 
ProPheGlyProAspGlyArgLeuLysThrTrpValTyrGlyVal 

4861 GCAGCTGGGGCATTTGTGTTACTCATCTTTATTGTCTCCATGATT 
AlaAlaGlyAlaPheValLeuLeuIlePhelleValSerMetlle 

4906 TATCTAGCTTGCAAAAAGCCAAAGAAACCCCAAAGAAGGCAAAAC 
TyrLeiiAlaCysLysLysProLysLysProGlnArgArgGlriAsn 

4951 AACCGACTGAAACCTTTAACCTTAGCCTATGATGGAGATGCCGAC 
AsnArgLeuLysProLeuThrljeiiAlaTyrAspGlyAspAlaAsp 

4996 ATGTAACATATAACTTTTCCTGGCAACAACCAGTTTCGGCTTTCT 
Met 
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5041 GACTTCATAGATGTCCAGAGGCCACAACAAATGTATCCAAACTGT 

5086 GTGGATTAAAATATATTTTAATTTTTAAAAATGGCATCATAAAGA 

5131 CAAGAGTGAAAATCATACTGCC ACTGGAGATATTTAAGAC AGTAC 

517 6 CACTTATATACAGACCATCAACCGTGAGAATTATAGGAGATTTAG 

5221 CTGAATAC ATGCTGC ATTCTGAAAGTTTTATGTCATCTTTTCTGA 

5266 AATCTACCGACTGAAAAACCACTTTCATCTCTAAAAAATAATGGT 

5311 GGAATTGGCCAGTTAGGATGCCTGATACAAGACCGTCTGCAGTGT 

5356 TAATCCATAAAACTTCCTAGCATGAAGAGTTTCTACCAAGATCTC 

5401 CAC AATACTATG6TC AAATTAAC ATGTGTACTC AGTTGAATGAC A 

5446 C AC ATTATGTC AGATTATGTACTTGCTAATAAGC AATTTTAAC AA 

5491 TGC ATAACAAATAAACTCTAAGCTAAGCAGAAAATCC ACTGAATA 

5536 AATTC AGCATCTTGGTGGTCGATGGTAGATTTTATTGACCTGCAT 

5581 TTCAGAGAC AAAGCCTCTTTTTTAAGACTTCTTGTCTCTCTCCAA 

5626 AGTAAGAATGCTGGACAAGTACTAGTGTCTTAGAAGAACGAGTCC 

5671 TC AAGTTCAGTATTTTATAGTGGTAATTGTCTGGAAAACTAATTT 

5716 ACTTGTGTTAATACAATACGTTTCTACTTTCCCTGATTTTCAAAC 

5761 TGGTTGCCTGC ATCTTTTTTGCTATATGGAAGGC AC ATTTTTGC A 

5806 CTATATTA6TGCAGC ACGATAGGCGCTTAACCAGTATTGCC ATAG 

5851 AAACTGCCTCTTTTCATGTGGGATGAAGACATCTGTGCCAAGAGT 

5896 GGCATGAAGACATTTGCAAGTTCTTGTATCCTGAAGAGAGTAAAG 

5941 TTCAGTTTGGATGGCAGC AAGATGAAATC AGCTATTACACCTGCT 

5986 GTACACACACTTCCTCATC ACTGCAGCCATTGTGAAATTGACAAC 

6031 ATGGCGGTAATTTAAGTGTTGAAGTCCCTAACCCCTTAACCCTCT 

6076 AAAAGGTGGATTCCTCTAGTTGGTTTGTAATTGTTCTTTGAAGGC 

6121 TGTTTATGACTAGATTTTTATATTTGTTATCTTTGTTAAGAAAAA 

6166 AAAAAGAAAAAGGAACTGGATGTCTTTTTAATTTTGAGCAGATGG 

6211 AGAAAATAAATAATGTATCAATGACCTTTGTAACTAAA66AAAAA 

6256 AAAAAAAAATGTGGATTTTCCTTTCTCTCTGATTTCCC AGTTTCA 

6301 GATTGAATGTCTGTCTTGCAGGCAGTTATTTCAAAATCCATAGTC 

6346 TTTNGCCTTTCTCACTGGCAAAATTTGA 
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CACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCCCCTTTGGAG 
CCTCCT6CT66GCCACTGGCTGGGATCAGGACACCAGTGAT6GTA 
AGTGCTGGCCCAGACTGAA6CTCGGA6AG6CACTCTGCTTGCCCA 
GCGTCACAGTCTTAGCTCCCAACTGTCCTGGCTTCCAGTCTCCCT 
TGCTTCCCAGATCCCAGACTCTAGCCCCAGCCCCGTCTCTTTCAC 
CAGCTCCTG6GACCCTACGCAATCTGCGCCTGCGTCTCATCAGTC 
GCCCCACATGTAACTGTATCTACAACCAGCTGCACCAGCGACACC 
TGTCCAACCCGGCCCGGCCTG6GATGCTAT6TGG6G6CCCCCAGC 
CTGGGGT6CA6GGCCCCT6TCAGGTCTGATAGGGAGAAGA6AAGG 
AGCAGAAGG6GAGGGGCCTAACCCTGGGCTGGGGGTTGGACTCAC 
AGGACTGGGG6AAAGAGCTGCAATCAGAGGGTGTCT6CCATAGCT 
6G6CTCA6GCATCTGTCCTTGGCTTT6TTGCCTGGCTCCAGG6AG 
ATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGACACTGGG 
TTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGG 
ACGCTCCTGTGCTGCTGACCAACACAGCTGCTCACAGTTCCTGGC 
T6CA6GCTC6A6TTCA66666CAGCTTTCCT6GCCCA6A6CCCA6 

A6ACCCCGGA6ATGAGTGAT6AGGACAGCTGT6TA6CCTGTGGAT 
MetSerAspGluAspSerCysValAlaCysGlyS 

CCTT6AG6ACAGCAGGTCCCCA6GCAGGA6CACCCTCCCCATGGC 
erLeuArgThrAlaGlyProGlnAlaGlyAlaProSerProTrpP 

CCTGGGAGGCCA6GCTGATGCACCAGGGACAGCTGGCCTGTGGCG 
roTz:p6luAlaA3rgI<euMet:HisGlnGly61nI<evjAlaCys61y6 

GAGCCCTG6TGTCAGAGGA6GCGGTGCTAACTGCTGCCCACTGCT 
lyAlaLeuValSerGluGluAlaValLeuThrAlaAlaHisCysP 

TCAATGGGCGCCAGGCCCCAGAGGAAT6GAGCGTAGG6CT6G66A 
heAsnGlyArgGlxiAlaProGluGluTirpSerValGlyljeuGlyT 

CCAGACC66AGGAGTGGGGCCTGAA6CAGCTCATCCTGCATGGA6 
hz-ArgProGluGluTrpGlyljeuLysGlnlieuIleLieuHisGlyA 

CCTACACCCACCCTGAGGGGGGCTACGACATGGCCCTCCTGCTGC 
laTyrThrHi sProGluGlyGlyTyr AspMe t Al aLeuLeuLeuIj 

TGGCTCAGCCTGTGACACTGGGAGCCAGCCTGCGGGCCCTCTGCC 
euAlaGlnProValThrlieuGlyAlaSerLeuArgAlaLeuCysIi 

TGCCCTATTTTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGG 
euProTyr PheAspHi sHi sLeuProAspGlyGltiArgGlyTrpV 

TTCTGGGACGGGCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGA 
allieuGlyArgAlEUlrgProGlyAlaGlylleSerSerLeuGInT 

CAGTGCCCGTGACCCTCCTGGGGCCTAGGGCCTGCAGCCGGCTGC 
hrValProValThrLeuljeuGlyProArgAlaCysSerArgLeuH 
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1216 ATGCAGCTCCTGGG6GTGATGGCAGCCCTATTCTGCCGGGGATGG 
isAlaAlaProGlyGlyAspGlySerProIleLeuProGlyMetV 

i2 6 1 TGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGGCCTGT 
alCysThrSerAlaValGlyGluLieuProSerCysGluGlylieuS 

1306 CTGGGGCACCACTGGTGC ATGAGGTGAGGGGC AC ATGGTTCCTGG 
erGlyAlaProIieuValHisGluValArgGlyThrTrpPheljeuA 

1351 CCGGGCTGCACAGCTTCGGAGATGCTTGCC AAGGCCCCGCCAGGC 
laGlyLeuHisSerPheGlyAspAlaCysGlxiGlyProAlaArgP 

1396 CGGCGGTCTTC ACCGCGCTCCCTGCCTATGAGGACTGGGTC AGC A 
roAlaValPheThrAlaLeuProAlaTyrGluAspTrpValSerS 

1441 GTTTGGACTGGCAGGTCTACTTCGCCGAGGAACCAGAGCCCGAGG 
erLeuAspTrpGlnValTyrPheAlaGluGluProGluProGluA 

1486 CTGAGCCTGGAAGCTGCCTGGCCAACATAAGCCAACCAACCAGCT 
laGluProGlySerCysIjeuAlaAsnXleSerGlnProThrSerC 

1531 GCTGACAGGGGACCTGGCC ATTCTCAGGACAAGAGAATGCAGGCA 

ys 

1576 GGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGT 

1621 GATTCCAGGC ACC AGGGC AGGCCC AGAAGCCC AGC AGCTGTGGGA 

1666 AGGAACCTGCCTGGGGCCACAGGTGCCCCCTCCCC ACCCTGCAGG 

1711 AC AGGGGTGTCTGTGGAC ACTCCC AC ACCC AACTCTGCTACC AAG 

1756 CAGGCGTCTCAGCTTTCCTCCTCCTTTACCCTTTCAGATACAATC 

1801 ACGCC AGCCCCGTTGTTTTGAAAATTTCTTTTTTTGGGGGGCAGC 

1846 AGTTTTCCTTTTTTTAAACTTAAATAAATTGTTAC AAAATAGACT 

1891 TTAG 



Fig. 2 (continued) 
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1 GCGGATCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCT 

4 6 GCAACCAAGCGGGTCTTACCCCCGGTCCTCCGCGTCTCCAGTCCT 

9 1 CGCACCTGGAACCCCAACGTCCCCGAGAGTCCCCGAATCCCCGCT 

136 CCCAGGCTACCTAAGAGGATGAGCGGTGCTCCGACGGCCGGGGCA 



181 GCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTCAG 
AlalieuMetLeuCysAlaAlaThrAlaVallieuLeuSerAlaGln 

226 GGCGGACCCGTGCAGTCCAAGTCGCCGCGCTTTGCGTCCTG6GAC 
GlyGlyProValGlnSerLysSerProArgPheAlaSerTrpAsp 

271 GAGATGAATGTCCTGGCGC ACGGACTCCTGC AGCTCGGCC AGGGG 
GluMetAsnValliexiAlaHisGlyLeuIjeuGlnljeuGlyGlnGly 

316 TGCGCGAAC ACCGGAGCGCACCCGCAGTCAGCTGAGCGCGCTGGA 
CysAlaAsnThrGlyAlaHisProGlnSerAlaGlxiArgAlaGly 

361 GCGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGG 
AlaArgLeuSerAlaCysGlySerAlaCysGlnGlyThrGluGly 

406 TCCACCGACCTCCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAG 
SerThrAspIieuProIieuAlaProGluSerArgValAspProGlu 

451 6TCCTTCACAGCCT6CA6ACACAACTCAAGGCTC AGAACAGCAGG 
ValljeuHxsSerljeuGlnThrGlnljeuIiysAlaGlxiAsnSerArg 

496 ATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCACCTG 
IleGlnGlnlieuPheHisIjysValAlaGlnGlnGlnArgHisIjeu 

541 GAGAAGCAGCACCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGC 
GluLysGlnHisLeuArgXleGlnHisLeuGlnSerGlnPlieGly 

586 CTCCTGGACCACAAGCACCTAGACCATGAGGTGGCCAAGCCTGCC 
lieuLeuAspHisIjysHisLeuAspHisGluValAlaLysProAla 

631 CGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCCGGCT 
ArgArgliysArgljeuProGluMetAlaGlnProValAspProAla 

676 C AC AATGTC AGCCGCCTGCACCGGCTGCCC AGGGATTGCC AGGAG 
HisAsnValSerArglieuHisArgljeuProArgAspCysGlnGlu 

721 CTGTTCCAGGTTGGGGAGAGGCAGAGTGGACTATTTGAAATCCAG 
IieuPheGlnValGlyGltiArgGlnSerGlyLeuPheGluXleGln 

766 CCTCAGGGGTCTCCGCCATTTTTGGTGAACTGCAAGATGACCTCA 
ProGlnGlySerProProPheLeuValAsziCysIiysMetThrSer 



MetSerGlyAlaProThrAlaGlyAla 
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811 GATGGAGGCTGGACAGTAATTC AGAGGCGCC ACGATGGCTCAGTG 
AspGlyGlyTrpThrVallleGlxiArgArgHisAspGlySerVal 

856 GACTTC AACCGGCCCTGGGAAGCCTACAAGGCGGGGTTTGGGGAT 
AspPheAsxiArgProTrpGluAlaTyrLysAlaGlyPheGlyAsp 

901 CCCCACGGCGAGTTCTGGCTGGGTCTGGAGAAGGTGCATAGCATG 
ProHlsGlyGluPheTrpIieuGlyljeuGluIjysValHisSerMet 

946 ATG6GGGACC6CAACAGCCGCCTG6CCGTGC AGCTGCGGGACTGG 
MetGlyAspArgAsnSerArgLeuAlaValGlnljexiArgAspTxp 

991 GATGGCAACGCCGAGTTGCTGCAGTTCTCCGTGCACCTGGGTGGC 
AspGlyAsnAlaGluLeuLeuGlnPheSerValHlsIieuGlyGly 

1036 GAGGACACGGCCTATAGCCTGC AGCTCACTGCACCCGTGGCCGGC 
GlxiAspThrAlaTyrSerljeuGlnljeuThrAlaProValAlaGly 

1081 CAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGTACCC 
GlnLeuGlyAlaThrThrValProProSerGlyljeuSerValPro 

1126 TTCTCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAAC 
PheSerThrTxpAspGlnAspHisAspLeuArgArgAspIiysAsn 

1171 TGCGCCAAGAGCCTCTCTGGAGGCTGGTGGTTTGGCACCTGCAGC 
Cy s Al aLy s S er LeuSer GlyGlyTrpTrpPheGlyThrCy s Ser 

1216 CATTCCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACAGCAG 
HisSerAsnLeuAsnGlyGlnTyrPheArgSerXleProGlziGln 

1261 CGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGC 
ArgGlnLysIjeuIjysIjysGlyXlePheTrpIjysThrTxpArgGly 

1306 CGCTACTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCATG 
ArgTyrTyrProLeuGlnAlaThrThrMetLeuIleGlnProMet 

1351 GCAGC AGAGGC AGCCTCCTAGCGTCCTGGCTGGGCCTGGTCCCAG 
AlaAlaGluAlaAlaSer 

1396 GCCCACGAAAGACGGTGACTCTTGGCTCTGCCCGAGGATGTGGCC 

1441 GTTCCCTGCCTGGGCAGGGGCTCCAAGGAGGGGCCATCTGGAAAC 

1486 TTGTGGACAGAGAAGAAGACCACGACTGGAGAAGCCCCCTTTCTG 

1531 AGTGCAGGGGGGCTGCATGCGTTGCCTCCTGAGATCGAGGCTGCA 

1576 GGATATGCTC AGACTCTAGAGGCGTGGACCAAGGGGCATGGAGCT 

1621 TCACTCCTTGCTGGCCAGGGAGTTGGGGACTCAGAG6GACCACTT 

1666 GGGGCCAGCCAGACTGGCCTCAATGGCGGACTCAGTCACATTGAC 

1711 TGACGGGGACCAGGGCTTGTGTGGGTCGAGAGCGCCCTCATGGTG 

1756 CTGGTGCTGTTGTGTGTAG6TCCCCTGGGGACACAAGCAGGCGCC 

1801 AATGGTATCTGGGC66AGCTCACAGAGTTCTTGGAATAAAAGCAA 

1846 CCTCAGAACA 



Fig. 3 (continued) 
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GGTAGCCGACGCGCCGGCCGGCGCGTGACCTTGCCCCTCTTGCTC 

GCCTTGAAAATGGAAAAGATGCTCGCAGGCTGCTTTCTGCTGATC 
Me t GluLy sMe t LeuAl aGlyCy s PheLeuLeuZ 1 e 

CTCGGACAGATCGTCCTCCTCCCTGCCGAGGCCAGGGAGCGGTCA 
LeuGlyGlnlleValLeuLeuProAlaGluAlaArgGluArgSer 

CGTGG6AGGTCCATCTCTAGGGGCAGACACGCTCGGACCCACCCG 
ArgGlyArgSerlleSerArgGlyArgHisAlaArgThrHisPro 

CAGACG6CCCTTCTGGAGAGTTCCTGTGAGAACAA6CG6GCAGAC 
GlxiThrAlaLeuLeuGluSerSerCysGluAsnljysArgAlaAsp 

CTGGTTTTCATCATT6ACAGCTCTCGCAGTGTCAACACCCATGAC 
LeuValPhellelleAspSerSerArgSerValAsziThrHisAsp 

TAT6CAAAGGTCAAGGAGTTCATCGTGGACATCTTGCAATTCTTG 
TyrAlaLysValLysGluPlielleValAspIlelieuGlnPhelieu 

GACATTGGTCCTGATGTCACCCGAGTGGGCCTGCTCCAATATGGC 
AspIleGlyProAspValThrArgValGlyLeuLeuGlnTyrGly 

AGCACTGTCAAGAATGAGTTCTCCCTCAAGACCTTCAAGAGGAAG 
SerThrValLysAsnGluPheSerlieuLysThrPheljysArgljys 

TCCGAGGTGGAGCGTGCTGTCAAGAGGATGCGGCATCTGTCCACG 
SerGluValGlTiArgAlaValljysArgMetArgHisIjeuSerThr 

GGCACCATGACTGGGCTGGCCATCCAGTATGCCCTGAACATCGCA 
GlyThrMetThrGlyLeuAlalleGlnTyrAlaLeuAsnlleAla 

TTCTCAGAAGCAGAGGGGGCCCGGCCCCTGAGGGAGAATGTGCCA 
PheSer61\iAla61uGlyAlcLArgProlje\iArgGl\iAsnValPro 

CGGGTCATAATGATCGTGACGGATGGGAGACCTCAGGACTCCGTG 
ArgVallleMetlleValThrAspGlyArgProGlnAspSerVal 

GCCGAGGTGGCTGCTAA6GCACGGGACACGGGCATCCTAATCTTT 
Al aGluVal Al aAl aLy s Al aAr gAspThr Gly I 1 eLeuI 1 ePhe 

GCCATTGGTGTGGGCCAGGTAGACTTCAACACCTTGAAGTCCATT 
AlalleGlyValGlyGlnValAspPlieAsnTlirlieuIiysSerlle 

GGGAGTGAGCCCCATGAGGACCATGTCTTCCTTGTGGCCAATTTC 
GlySerGluProHisGluAspHxsValPheljeuValAlaAsnPhe 

AGCCAGATTGAGACGCTGACCTCCGTGTTCCAGAAGAAGTTGTGC 
SerGlnlleGluThrlieuThrSerValPlieGlnliysIiysIieuCys 
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766 ACGGCCCACATGTGCAGCACCCTGGAGCATAACTGTGCCCACTTC 
ThrAlaHisMetCysSerThrlieuGluHisAsnCysAlaHisPhe 

811 TGCATCAACATCCCTGGCTCATACGTCT6CAGGTGCAAACAA6GC 
CysIleAsnlleProGlySerTyrValCysArgCysIjysGlnGly 

856 TACATTCTCAACTCGGATCAGACGACTTGCAGAATCCAGGATCTG 
TyrllelteuAsnSerAspGlnThrThrCysArgZleGlnAspIjeu 

901 TGTGCCATGGAGGACCACAACTGTGAGC AGCTCTGTGTGAATGTG 
CysAlaMetGluAspHisAsnCysGluGlnljeuCysValAsnVal 

946 CCGGGCTCCTTCGTCTGCGAGTGCTACAGTGGCTAC6CCCTGGCT 
PxroGlySer-PheValCysGluCysTyrSer-GlyTyrAlaliexiAla 

991 GAGGATGGGAAGAGGTGTGT6GCT6TGGACTACTGTGCCTCAGAA 
GluAspGlyLysArgCysValAlaValAspTyrCysAlaSerGlu 

1036 AACCACGGATGTGAAC ATGAGTGTGTAAATGCTGATGGCTCCTAC 
AsnHlsGlyCysGluHisGluCysValAsnAlaAspGlySerTyr 

1081 CTTTGCCAGTGCCATGAAGGATTTGCTCTTAACCCAGATGAAAAA 
LeuCysGinCysHisGluGlyPheAlaLexiAsnProAspGluLys 

112 6 ACGTGCACAAAGATAGACTACTGTGCCTCATCTAATCATGGATGT 
ThrCy sThrLy s I leAspTyrCy s AlaSer Ser AsnHi sGlyCy s 

1171 C AGTACGAGTGTGTTAAC ACAGATGATTCCTATTCCTGCC ACTGC 
GlnTyrGluCysValAsziThrAspAspSex'TyrSerCysHisCys 

1216 CTGAAAGGCTTTACCCTGAATCCAGATAAGAAAACCTGCAGAAGG 
LeuLysGlyPheThrLeuAsnProAspLysLysThrCysArgArg 

1261 ATCAACTACTGTGCACTGAACAAACCGGGCTGTGAGCATGAGTGC 
IleAsnTyrCysAlaLeuAsnLysProGlyCysGluHisGluCys 

1306 GTCAACATG6AGGAGAGCTACTACTGCCGCTGCC ACCGTGGCTAC 
ValAsnMetGluGluSerTyrTyrCysArgCysHisArgGlyTyr 

1351 ACTCTGGACCCC AATGGCAAACCCTGC A6CCGAGTG6ACCACT6T 
ThrLeuAspProAsnGlyLysProCysSerArgValAspHisCys 

1396 GCACA6CA66ACC ATGGCTGTGAGCAGCTGT6TCTGAAC ACGGAG 
AlaGlnGlnAspHlsGlyCysGluGlnLeuCysLeuLAsnThrGlu 

1441 GATTCCTTCGTCTGCCAGTGCTCAGAAGGCTTCCTCATCAACGAG 
AspSerPheValCysGlnCysSerGluGlyPhelieuIleAsnGlu 



Fig. 4 (continued) 
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1486 GACCTCAAGACCTGCTCCCGGGTGGATTACT6CCTGCTGAGTGAC 
AspLeuliysThrCysSerArgValAspTyrCysLeuIjeuSerAsp 

1531 CATGGTTGTGAATACTCCTGTGTCAAC ATGGAC AGATCCTTTGCC 
HisGlyCysGluTyrSerCysValAsnMetAspArgSerPheAla 

1576 TGTC AGTGTCCT6A6GGACACGTGCTCCGCAGCGATGGGAAGACG 
CysGlnCysProGluGlyHisVallieiiArgSerAspGlyliysThr 

1621 TGT6CAAAATTGGACTCTTGTGCTCTGGGGGACCACGGTTGTGAA 
CysAlaLysLeuAspSerCysAlaLeuGlyAspHisGlyCysGlu 

1666 CATTCGTGTGTAAGC AGTGAAGATTCGTTTGTGTGCC AGTGCTTT 
HisSerCysValSerSerGluAspSerPheValCysGlnCysPhe 

1711 GAAGGTTATATACTCCGTGAAGATGGAA7LJUVCCTGC AGAAGGAAA 
GluGlyTyrllelieuArgGluAspGlyljysThrCysArgArgliys 

1756 GATGTCTGCCAAGCTATAGACC ATG6CTGTGAACACATTTGTGTG 
AspValCysGlnAlaXleAspHisGlyCysGluHlsIleCysVal 

1801 AAC AGTGACGACTCATAC ACGTGCGAGTGCTTGGAGGGATTCCGG 
AsnSerAspAspSerTyrThrCysGluCysLeuGluGlyPheArg 

1846 CTC ACTGAGGATGGGAAACGCTGCCGAATTTCCTC AGGGAA6GAT 
LeuThrGlviAspGlyLysArgCysArglleSerSerGlyLysAsp 

1891 GTCTGCAAATC2^CCCACCATGGCT6CGAACACATTTGTGTTAAT 
ValCysLysSerThrHisHisGlyCysGluHisIleCysValAsn 

1936 AATGGGAATTCCTACATCTGC AAATGCTCAGAGGGATTTGTTCTA 
AsnGlyAsnSerTyrlleCysLysCysSerGluGlyPheValLeu 

1981 GCTGAGGACG6AAGACGGTGCAAGAAATGCACT6AAGGCCC AATT 
AlaGltiAspGlyArgArgCysLysLysCysThrGluGlyProIle 

2026 GACCTGGTCTTTGTGATCGATGGATCCAAGAGTCTTGGAGAAGAG 
AspLeuVa 1 PheVa 111 e AspGly S e r Ly s Se r LeuGlyGluGlu 

2071 AATTTTGAGGTCGTGAAGC AGTTTGTC ACTGGAATTATAGATTCC 
AsnPheGluValValLysGlnPheValThrGlyllelleAspSer 

2116 TTGAC AATTTCCCCC AAA6CCGCTCGAGTGGGGCTGCTCCAGTAT 
LeuThr X 1 e Ser Pr oLy s Al aAl aAr gVa 1 G lyLeuLeuGlnTy r 

2161 TCCACACAGGTCCACACAGAGTTCACTCTGAGAAACTTCAACTCA 
SerThrGlnValHisThrGluPheThrlieuArgAsnPheAsnSer 

2206 GCCAAAGACATGAA2UUUVGCCGTGGCCCACATGAAATACATGG6A 
AlaLysAspMetLysLysAlaValAlaHisMetLysTyrMetGly 



Fig. 4 (continued) 
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2251 AAGG6CTCTATGACTG6GCT66CCCTGAAACACAT6TTT6A6A6A 
Ly sGly S erMe t ThrGlyLeuAl aLeuLy sHi sMe t PheGlxiArg 

2296 AGTTTTACCC AAGGA6AAG6G6CCA6GCCCCTTTTCCAC AAGGGT 
SerPheThrGlnGlyGluGlyAlaArgProLeuPheHisLysGly 

2341 6CCCAGAGCAGCCATT6T6TTCACCGACGGACGGGCTCAGGATGA 
AlaGlnSerSerHlsCysValHisArgArgThirGlySerGly 

2386 C6TCTCC6AGTGG6CCAGTAAAGCCAAGGCCAAT66TATCACTAT 

2431 6TATGCTGTTG6GGTA6GAAAAGCCATTGA6GAGGAACTACAAGA 

247 6 GATTGCCTCTGA6CCCACAAACAAGC ATCTCTTCTATGCCGAAGA 

2521 CTTCAGCACAATGGATGAGATAAGTGAAAAACTCAAGAAAGGCAT 

2566 CTGTGAAGCTCTAGAAGACTCCGATGGAAGAC A6GACTCTCCAGC 

2611 AGGGGAACTGCCAAAAACGGTCCAACAGCCAACAGAATCTGAGCC 

2656 AGTCACCATAAATATCCAAGACCTACTTTCCTGTTCTAATTTTGC 

2701 AGTGCAAC AC AGATATCTGTTTGAAGAAGAC AATCTTTTACGGTC 

2746 TAG AC AAAAGCTTTCCC ATTC AACAAAACCTTC AGGAAGCCCTTT 

2791 GGAAGAAAAACACGATCAATGCAAATGTGAAAACCTTATAATGTT 

2836 CCAGAACCTTGCAAACGAAGAAGTAAGAAAATTTACACAGCGCTT 

2881 AGAAGAAATGACACAGAGAATGGAAGCCCTGGAAAATCGCCTGAG 

2926 ATACAGATGAAGATTAGAAATCGCGACACATTTGTAGTCATTGTA 

2971 TC ACGGATTAC AATGAACGCAGTGC AGAGCCCCAAAGCTCAGGCT 

3016 ATTGTTAAATC 



Fig. 4 (continued) 
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1 GGTAGCCGACGCGCCGGCCGGCGCGTGACCTTGCCCCTCTTGCTC 
4 6 6CCTTGAAAATGGAAAAGATGCTCGCAGGCT6CTTTCTGCTGATC 

MetGluLysMet^LeiiAlaGlyCysPheljeuLeuXle 

9 1 CTC6GACAGATCGTCCTCCTCCCCTGCGAGGCCAGGGAGCGGTCA 
LeuGlyGlnlleVallieuLeuProCysGluAlcmrgGluArgSer 

136 CGTGGGAGGTCCATCTCTAGGGGC AGAC ACGCTCGGACCC ACCCG 
ArgGlyArgSerlleSerArgGlyArgHisAlaArgThrHisPro 

181 C AGACGGCCCTTCTGGAGAGTTCCTGTGAGAAC AAGCGGGC AGAC 
GlnThrAlalieuIieuGluSerSerCysGluAsnLysArgAlaAsp 

226 CTGGTTTTCATCATTGAC AGCTCTCGC AGTGTCAAC ACCCATGAC 
LeuValPhelleXleAspSerSerArgSerValAsnTlirHisAsp 

271 TATGC AAAGGTC AAGGAGTTC ATCGTGGAC ATCTTGCAATTCTTG 
TyrAlaLysValLysGluPhelleValAspXleljeuGlnPheljeu 

316 GACATTGGTCCTGATGTC ACCCGAGTGGGCCTGCTCC AATATGGC 
AspIleGlyProAspValThrArgValGlyLeuLeuGlnTyrGly 

361 AGCACTGTCAAGAATGAGTTCTCCCTCAAGACCTTC AAGAGGAAG 
SerTlirValljysAsnGluPheSerLeuIjysThrPheljysArgljys 

406 TCCGAGGTGGAGCGTGCTGTCAAGAGGATGCGGCATCTGTCC ACG 
SerGluValGluArgAlaValLysArgMetArgHlsLeuSerThr 

451 GGCACCATGACTGGGCTGGCCATCCAGTATGCCCTGAACATCGCA 
GlyThrMetThrGlyLeuAlalleGlnTyrAlaLeuAsnlleAla 

496 TTCTCAGAAGCAGAGGGGGCCCGGCCCCTGAGGGAGAATGTGCCA 
PheSerGluAlaGluGlyAlaArgProLeuArgGluAsnValPro 

541 CGGGTC ATAATGATCGTGACGGATGGGAGACCTCAGGACTCCGTG 
ArgVallleMetlleValThrAspGlyArgProGlnAspSerVal 

586 GCCGAGGTGGCTGCTAAGGCACGGGAC ACGGGC ATCCTAATCTTT 
AlaGluValAlaAlaLysAlaArgAspThrGlyllelieuIlePlie 

631 GCCATTGGTGTGGGCC AGGTAGACTTCAACACCTTGAAGTCC ATT 
Al all eGly Val GlyG 1 nVa 1 AspPhe AsnThr LeuLy s S e rl 1 e 

676 GGGAGTGAGCCCCATGAGGACCATGTCTTCCTTGTGGCCAATTTC 
GlySerGluProHisGliiAspHisValPheLeuValAlaAsnPhe 

721 AGCCAGATTGAGACGCTGACCTCCGTGTTCCAGAAGAAGTTGTGC 
SerGlnlleGluThrLeuThrSerValPheGlnliysIjysLeuCys 



Fig. 5 
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7 66 ACGGCCCACATGTGCAGCACCCTGGAGCATAACTGTGCCCACTTC 
ThrAlaHisMetCysSerThrLeuGluHisAsnCysAlaHisPhe 

811 TGC ATC AAC ATCCCTGGCTC ATACGTCTGC AGGTGC AAAC AAGGC 
CysIleAsnlleProGlySerTyrValCysArgCysLysGlnGly 

856 TACATTCTCAACTCGGATCAGACGACTTGCAGAATCCAGGATCTG 
TyrXleLeuAsnSerAspGlnThrThrCysArgXleGlnAspLeu 

901 TGTGCC ATGGAGGACCACAACTGTGAGCAGCTCTGTGTGAATGTG 
CysAlaMetGliiAspHisAsnCys61u61nI«euCysValAsnVal 

946 CCGGGCTCCTTCGTCTGCGAGTGCTACAGTGGCTACGCCCTGGCT 
ProGlySerPheValCysGluCysTyrSerGlyTyrAlaLetiAla 

991 GAGGATGGGAAGAGGTGTGTGGCTGTGGACTACTGTGCCTCAGAA 
GlxiAspGlyLysArgCysValAlaValAspTyrCysAlaSerGlu 

1036 AACCACGGATGTGAACATGAGTGTGTAAATGCTGATGGCTCCTAC 
AsnHisGlyCysGluHisGluCysValAsnAlaAspGlySerTyr 

1081 CTTTGCC AGTGCC ATGAAGGATTTGCTCTTAACCC AGATGAAAAA 
Ij6uCysGlnCysHisGlu6lyPheAla£jeuAsnProAspGluLys 

1126 ACGTGC AC AAAGATAGACTACTGTGCCTC ATCTAATC ATGGATGT 
ThrCysThrLysIleAspTyrCysAlaSerSerAsnHisGlyCys 

1171 CAGTACGAGTGTGTTAACACAGATGATTCCTATTCCTGCCACTGC 
GlnTyrGluCysValAsnThrAspAspSerTyrSerCysHisCys 

1216 CTGAAAGGCTTTACCCTGAATCCAGATAAGAAAACCTGCAGAAGG 
LeuLysGlyPheThrLeuAsnProAspLysLysThrCysArgArg 

1261 ATCAACTACTGTGCACTGAACAAACC6GGCTGTGAGCATGAGTGC 
IleAsnTyrCysAlaLeuAsnliysProGlyCysGluHisGluCys 

1306 GTCAACATGGAGGAGAGCTACTACTGCCGCTGCCACCGTGGCTAC 
ValAsnMetGluGluSerTyrTyrCysArgCysHisArgGlyTyr 

1351 ACTCTGGACCCCAATGGCAAACCCTGCAGCCGAGTGGACCACTGT 
ThrLeuAspProAsnGlyLysProCysSerArgValAspHisCys 

1396 GCACAGCAGGACC ATGGCTGTGA6CA6CT6TGTCTGAACACGGAG 
AlaGlnGlnAspHisGlyCysGluGlnLeuCysLeiiAsnTlirGlu 

1441 GATTCCTTCGTCTGCCAGTGCTCAGAAGGCTTCCTCATCAACGAG 
AspSerPlieValCysGlnCysSerGluGlyPheLeuIleAsnGlu 

1486 GACCTCAAGACCTGCTCCCGGGTGGATTACTGCCTGCTGAGTGAC 
AspLeuLysThrCysSerArgValAspTyrCysLeuLeuSerAsp 

Fig. 5 (continued) 
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1531 CATGGTTGTGAATACTCCTGTGTCAACATGGACAGATCCTTTGCC 
HisGlyCysGluTyrSerCysValAsnMetAspArgSerPheAla 

1576 T6TCA6TGTCCT6A66GACAC6TGCTCCGCAGCGAT6GG2^6AC6 
CysGlnCysPro61uGlyHisValI<euArgSerAspGlyI.ysThr 

1621 TGTGCAAAATTGGACTCTTGTGCTCTGGGGGACCACGGTTGTGAA 
CysAlaLysLeviAspSerCysAlalieuGlyAspHisGlyCysGlu 

1666 CATTCGTGTGTAAGCAGTGAA6ATTCGTTTGTGTGCCA6TGCTTT 
HisSerCysValSerSerGluAspSerPheValCysGlnCysPhe 

1711 GAA6GTTATATACTCCGTGAAGAT66AAAAACCTGCA6AAGGAAA 
GluGlyTyrXlelieuArgGluAspGlyljysThrCysArgArgliys 

1756 GATGTCTGCCAAGCTATAGACCATGGCTGTGAACACATTTGTGT6 
AspValCy sGlnAlal leAspHl sGlyCy sGluHi s I leCy s Val 

1801 AACAGTGACGACTCATACACGTGCGA6TGCTTG6A666ATTCCG6 
AsnSerAspAspSerTyrThrCysGluCysLeuGluGlyPheArg 

1846 CTCACTGAGGAT6GGAAACGCT6CC6AATTTCCTCA6G6AA66AT 
LeuThrGluAspGlyLysArgCysArglleSerSerGlyljysAsp 

1891 GTCTGCAAATCAACCCACCATGGCTGCGAACACATTTGTGTTAAT 
ValCysLysSerThrHlsHlsGlyCysGluHlsIleCysValAsn 

1936 AAT6GGAATTCCTACATCTGCAAATGCTCAGAGGGATTTGTTCTA 
AsnGlyAsnSerTyrlleCysIiysCysSerGluGlyPheValLeu 

1981 GCTGA6GACGGAAGACGGTGCAAGAAATGC ACTGAAGGCCCAATT 
AlaGlxiAspGlyArgArgCysIiysItysCysThrGluGlyProIle 

2026 6ACCTGGTCTTT6TGATCGATGGATCCAAGAGTCTTGGAGAAGAG 
AspljeuValPheValJleAspGlySerl.ysSeri:jeuGlyGluGlu 

2071 AATTTTGAGGTCGTGAAGC AGTTTGTCACTGGAATTATAGATTCC 
AsnPheGluValValLysGlnPheValThrGlyllelleAspSer 

2116 TTGAC AATTTCCCCC AAAGCCGCTCGAGTGGGGCTGCTCCAGTAT 
LeuThrlleSerProIiysAlaAlaArgValGlylieuIieuGlnTyr 

2161 TCCAC ACAGGTCCAC AC AGAGTTC ACTCTGAGAAACTTC AACTC A 
S erThrGlnValHl sThrGluPheThr LexiArgAsnPheAsnSer 

2206 GCCAAAGAC ATGAAAAAAGCCGTGGCCCAC ATGAAATACATGGGA 
AlaLysAspMetliysIiysAlaValAlaHisMetLysTyrMetGly 
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2251 AAGGGCTCTATGACTGGGCTGGCCCTGAAAC ACATGTTTGAGAGA 
LysGlySerMetTlirGlyLeuAlaLeuIjysHisMetPheGluAxrg 

2296 AGTTTTACCCAAGGAGAAGGGGCCAG6CCCTTTTCCACAAGGGTG 
S er PheThr GlnGly GluGlyAl aAr gPr oPhe S er Thr Ar gVal 

2341 CCCAGAGCAGCCATTGTGTTC ACCGACGGACGGGCTC AGGATGAC 
Pr oAr gAl aAla 1 1 e Va 1 PheThr AspGlyArgAl aGlnAspAsp 

2386 GTCTCCGAGTGGGCC AGTAAAGCCAAGGCCAATGGTATC ACTATG 
ValSerGluTrpAlaSerljysAlaljysAlaAsnGlylleThrMet 

2431 TATGCTGTTGGGGTAGGAAAAGCCATTGAG6AGGAACTACAAGAG 
TyrAlaValGlyValGlyDysAlalleGluGluGluLeuGlnGlu 

2476 ATTGCCTCTGAGCCCACAAACAAGCATCTCTTCTATGCCGAAGAC 
I leAlaSerGluProThrAsnLy sHi sLeuPheTyr AlaGlxiAsp 

2521 TTCAGCACAATGGATGAGATAAGTGAAAAACTC AAGAAAGGC ATC 
PheSerThrMetAspGluIleSerGluLysLeuliysLysGlylle 

2566 TGTGAAGCTCTAGAAGACTCCGATGGAAGACAGGACTCTCCAGCA 
CysGluAlaLeuGluAspSerAspGlyArgGlnAspSerProAla 

2611 GGGGAACTGCCAAAAACGGTCCAACAGCCAACAGAATCTGAGCC A 
GlyGluLeuProIiysThrValGlnGlnProThrGluSerGluPx-o 

2656 GTCACCATAAATATCCAAGACCTACTTTCCTGTTCTAATTTTGCA 
ValThrl leAsnX leGlnAspIieuLeuSerCysSerAsnPheAla 

2701 GTGCAACACAGATATCTGTTTGAAGAAGAC AATCTTTTACGGTCT 
ValGlziHlsArgTyrljeuPheGluGluAspAsxiIjeuLeuArgSer 

2746 ACACAAAAGCTTTCCCATTCAACAAAACCTTCAGGAAGCCCTTTG 
ThrGlziLysLeuSerHisSerThrljysProSerGlySerProIjeu 

2791 GAAGAAAAAC ACGATC AATGC AAATGTGAAAACCTTATAATGTTC 
GluGluIiysHisAspGlnCysLysCysGluAsziIieuIleMetPhe 

2836 CAGAACCTTGCAAACGAAGAAGTAAGAAAATTAACACAGCGCTTA 
GlnAsnLetiAlaAsnGluGluValArgLyslieuThrGlnArglieu 

2881 GAAGAAATGACACAGAGAATGGAAGCCCTGGAAIU^TCGCCTGAGA 
GluGluMetThrGlnArgMetGluAlalieuGluAsnArgljeuArg 

2926 TACAGATGAAGATTAGAAATC6CGACACATTT6TA6TCATTGTAT 
TyrArg 



Fig. 5 (continued) 
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2971 CACGGATTACAATGAACGCAGTGCAGAGCCCCAAAGCTCAGGCTA 

3016 TTGTTAAATCAATAATGTTGT6AA6TAAAAC AATCA6TACT6AGA 

3061 AACCTGGTTTGCCAC A6AACA7VAGACAA6AAGTATAC ACTAACTT 

3106 GTATAAATTTATCTAGGAAAAAAATCCTTCAGAATTCTAAGATGA 

3151 ATTTACCAGGTGAGAATGAATAAGCTATGCAA6GTATTTTGTAAT 

3196 ATACTGTGGACACAACTTGCTTCTGCCTCATCCTGCCTTAGTGTG 

3241 CAATCTCATTTGACTATAC6ATAAAGTTT6CACAGTCTTACTTCT 

3286 GTAGAACACTGGCCATAGGAAATGCTGTTTTTTTGTACTGGACTT 

3331 TACCTTGATATATGTATATGGATGTATGCATAAAATCATAGGACA 

3376 TATGTACTTGTGGAACAAGTTGGATTTTTTATACAATATTAAAAT 

3421 TCACCACTTCAGAGAAAA6TAAAAAAA 



Fig. 5 (continued) 
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1 CGGCCCTTCTCACACTCCTGCCCTGCTGATGTGGAACGGGGTTTG 
4 6 G6GTTCTGCA66GCTATTGTCTGCGCTGGGGAAGGGGACAGGCCG 
9 1 GGACCGGGACCTCC6CTCGCAGCCGGCCGCACCAGCAGGACAGCT 

136 GGCCTGAAGCTCAGAGCCGGGGCGTGCGCCATGGCCCCAC ACTGG 



181 GCTGTCTGGCTGCTGGCAGCAAGGCTGTGGGGCCTGGGCATTGGG 
Al aVa iTrpIieuLeuAl aAl aAr gLeuTrpGly LeuGly 1 1 eGly 

226 GCTGAGGTGTGGTGGAACCTTGTGCCGCGTAAGACAGTGTCTTCT 
AlaGluValTrpTrpAsxiLeuValProArgLysThrValSerSer 

271 GGGGAGCTGGCC ACGGTAGTACGGCGGTTCTCCCAGACCGGC ATC 
GlyGluIieuAlaThrValValArgArgPheSerGlnThrGlylle 

316 C AGGACTTCCTGACACTGACGCTGACGGAGCCCACTGGGCTTCTG 
GlnAspPheIjeuTh.rIjeuTlirIieuTh.rGluProThrGlyIjeuIieu 

361 TACGTGGGCGCCCGAGAGGCCCTGTTTGCCTTCAGCATGGAGGCC 
TyrValGlyAlaAx-gGliiAlaLeuPlieAlaPheSerMetGlvLAla 

406 CTGGAGCTGCAAGGAGCGATCTCCT6GGAGGCCCCCGTGGAGAAG 
LeuGluLeuGlnGlyAlalleSerTrpGlxiAlaProValGluIjys 

451 AAGACTGAGTGTATCCAGAAAGGGAAGAACAACCAGACCGAGTGC 
LysThrGluCysIleGlnLysGlyLysAsnAsnGlnThrGluCys 

496 TTCAACTTCATCCGCTTCCTGCAGCCCTACAATGCCTCCCACCTG 
PheAsnPhelleArgPheLeuGlnProTyrAsnAlaSerHisIieu 

541 TACGTCTGTGGCACCTACGCCTTCCAGCCCAAGTGCACCTACGTC 
TyrValCysGlyThrTyrAlaPheGlnProLysCysThrTyrVal 

586 AAC ATGCTCACCTTCACTTTGGAGCATGGAGAGTTTGAAGATGGG 
AsnMetLeuThrPlieThrLeuGluHisGlyGluPheGluAspGly 

631 AAGGGCAAGTGTCCCTATGACCCAGCTAAGGGCCAT6CT6GCCTT 
LysGlyLysCysProTyrAspProAlaliysGlyHisAlaGlylieu 

676 CTTGTGGATGGTGAGCTGTACTCGGCCACACTCAACAACTTCCTG 
LeuValAspGlyGluLeuTyrSerAlaThrljeuAsnAsnPhelieu 

721 GGC ACGGAACCCATTATCCTGCGTAACATGGGGCCCCACCACTCC 
GlyThrGluProIlelleLeiiArgAsnMetGlyProHlsHisSer 



MetAlaProHisTzp 




:a. U y M-s-H ii& 6- m U c!' .1, ."L" O b;:^ 



23/35 



766 ATGAAGACAGAGTACCTGGCCTTTTGGCTCAACGAACCTCACTTT 
MetLysThrGluTyrLexiAlaPheTrpLeiiAsnGluProHisPhe 

811 GTAGGCTCTGCCTATGTACCTGAGAGGGTGGGCCTGCTGTGGACA 
ValGlySerAlaTyrValProGlxiArgValGlyLeuLeuTrpThr 

856 ATGGCATACTCTCTTCCAGCCCTA6GAGGAGGGCTCCTAACAGTG 
MetAlaTyrSerLeuProAlaLeuGlyGlyGlylieuLeuThrVal 

901 TAACTTATTGTGTCCCCGCGTATTTATTTGTTGTAAATATTTGAG 
946 TATTTTTATATTGACAAATAAA 



Fig. 6 (continued) 
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1 GGCACCA6GCCTTCCGGAGAGACGCAGTCGGCTGCCACCCCGGGA 

M 

4 6 TGGGTCGCTGGTGCCAGACCGTCGCGCGCGGGCAGCGCCCCCGGA 
etGlyArgTrpCysGlnThrValAlaArgGlyGlnArgProArgT 

91 CGTCTGCCCCCTCCC6CGCCGGTGCCCTGCTGCTGCTGCTTCTGT 
hrSerAlaProSerArgAlaGlyAlaLeuLeuIieuIieuLeuIjeuIj 

136 TGCTGAGGTCTGCAGGTTGCTGGGGCGCAG6G6AAGCCCCGGGGG 
euLeuArgSerAlaGlyCysTrpGlyAlaGlyGltiAlaProGlyA 

181 CGCTGTCCACTGCTGATCCCGCCGACCAGAGCGTCCAGTGTGTCC 
laLeuSerThrAlciAspProAlcLAspGlnSerValGlnCysValP 

226 CCAAGGCCACCTGTCCTTCCAGCCGGCCTCGCCTTCTCTGGCAGA 
roLysAlaThrCysProSerSerArgProArgLeuIjeuTrpGlnT 

271 CCCCGACCACCCAGACACTGCCCTCGACCACCATGGAGACCCAAT 
hrProThrThrGlnThrLeuProSerThrThrMetGluThrGlnP 

316 TCCC AGTTTCTGAAGGCAAAGTCGACCC ATACCGCTCCTGTGGCT 
heProValSerGluGlyljysValAspProTyrArgSerCysGlyP 

361 TTTCCTACGAGCAGGACCCCACCCTCAGGGACCC AGAAGCCGTGG 
heSerTyrGluGlnAspProThrLeuArgAspProGlxiAlaValA 

406 CTCGGCGGTGGCCCTGGATG6TCAGCGTGC6GGCCAATGGCACAC 
1 aAr gAr gTrpPr oTrpMe t Va 1 S e rVa 1 Ar gAl aAsnGlyThr H 

451 ACATCTGTGCCGGC ACCATCATTGCCTCCCAGTGGGTGCTGACTG 
isIleCysAlaGlyTlirllelleAlaSerGlnTrpValLeuThrV 

496 TGGCCCACTGCCTGATCTGGCGTGATGTTATCTACTCAGTGAGGG 
alAlaHisCysLeuIleTrpArgAspVallleTyrSerValArgV 

541 TGGGGAGTCCGTGGATTGACCAGATGACGCAGACCGCCTCCGATG 
alGlySerProTrpIleAspGlnMetThrGlnThrAlaSerAspV 

586 TCCCGGTGCTCCAGGTCATCATGCATAGCAGGTACCGGGCCCAGC 
alProVallieuGlnVallleMetHisSerArgTyrArgAlaGlnA 

631 GGTTCTGGTCCTGGGTGGGCCAGGCC AACGAC ATCGGCCTCCTCA 
rgPheTrpSerTrpValGlyGlnAlaAsnAspIleGlyLeuLeuI* 

67 6 AGCTCAAGCAGGAACTCAAGTACAGCAATTACGTGCGGCCCATCT 
ysLeuIiysGlnGluLeuLysTyrSerAsnTyrValArgProIleC 



Fig. 7 
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721 GCCTGCCTGGCACGGACTATGTGTTGAAGGACCATTCCCGCTGCA 
ysLeuProGlyThrAspTyrValLeuLysAspHisSerArgCysT 

766 CTGTGACGGGCTGGGGACTTTCCAAGGCTGACGGCATGTGGCCTC 
hrValThrGlyTrpGlyLeuSerLysAlaAspGlyMetTrpProG 

811 AGTTCCGGACC ATTCAGGAGAAGGAAGTCATC ATCCTGAACAAC A 
InPheArgThrlleGlnGluLysGluValllellelieuAsnAsnL 

856 AAGAGTGTGACAATTTCTACCACAACTTCACCAAAATCCCCACTC 
y sGluCy sAspAsnPheTyrHi s AsnPheTlirLy s 1 1 ePr oThr L 

901 TGGTTCAGATCATC AAGTCCC AGATGATGTGTGCGGAGGACACCC 
euValGlnllelleltysSerGlxiMetMetCysAlaGltiAspThrH 

946 ACAGGGAGAAGTTCTGCTATGAGCTAACTGGAGAGCCCTTGGTCT 
isArgGluLysPheCysTyrGluLeuThrGlyGluProIieuValC 

991 GCTCCATGGAGGGCACGTGGTACCTG6T6G6ATTGGTGAGCTGG6 
ysSerMetGluGlyThrTiTpTyrLeuValGlyLeuValSerTrpG 

1036 GTGCAGGCTGCCAGAAGAGCGAGGCCCCACCCATCTACCTACAGG 
lyAlaGlyCysGlnLysSerGluAlaProProIleTyrLeuGlnV 

1081 TCTCCTCCTACCAACACTGGATCTGGGACTGCCTCAACGGGC AGG 
alSerSerTyrGlnHisTrpIleTrpAspCysLeuAsnGlyGlnA 

1126 CCCTGGCCCTGCCAGCCCCATCCAGGACCCTGCTCCTGGCACTCC 
laLeuAlaLeuProAlaProSerArgThrLeuLeuIieuAlaljeuP 

1171 CACTGCCCCTCAGCCTCCTTGCTGCCCTCTGACTCTGTGTGCCCT 
roLeuProLeuSerlieuljeuAlaAlalieu 

1216 CCCTCACTTGTGGGCCCCCCTTGCCTCCGTGCCCAGGTTGCTGTG 

1261 GGTGCAGCTGTCACAGCCCTGAGAGTCAGGGTGGAGATGAGGTGC 

1306 TCAATTAAACATTACTGTTTTCCATGTAAAAAJUUU^AAAAAAAAA 

1351 AAAAAAAAA 
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CACCCCTCT(X:CTGCCCCAGCCCGCCCATCGCTTCCCCTTT6GAGCCTCCT6CTGGGCCACT6GCT^^ 
81 

AGTGATGGTAAGTGCTGGCCCAGACT6AAGCTC66AGAGGCACTCTGCTTGCCCA(X:GTCACAGTCmGCTC^ 
161 

TCCTGGCTTCCAGTCTCCCTTGCTTCCOVGATCCCAGACTCTAGCCCCAGCCCCGTCTCTTTC 
241 

TAC6CAATCTGC6CCTGCGTCTCATCAGTCGCCCCACAT6TAACTGTATCTACAACCA6CTGCACCAGCGACACCTGTCC 
321 

AACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTGGGGTGCAGGGCCCCTGTCAGGTCTG^ 
401 

GAAGGAGCAGAAGGGGAGGGGCCTAACCCTGGGCTGGGGGTTGGACTCACAGGACTGGGGGAAAGAG^ 
481 

GTGTCTGCCATAGCTGGGCTCAGGCATCTGTCCTTGGCTmTTGCCTGGCTCCAGGGAGATTCC 
561 

TGCCTCGAGCCTGACGGACACTGGGTTCAGGCTGGOkTCATCAGCTTTGCATaU^ 
641 

GCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTTTCCTGGCCCAGAG^ 
721 

A6ACCCCG6AGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGATCCTT6AGGACAGCAGGTCCCCAGGCAGGAGCACCC 
MetSerAspGluAspSerCysValAlaCysGlySerLeuArgThrAlaGlyProGlnAlaGlyAlaPro 
801 

TCCCCATGGCCCT6GGAGGCCAGGCTGAT6CACCAGGGACAGCTGGCCT6TGGCGGAGCCCTG6TGTCAGAGGAGGCGGT 
SerProTrpProTrpGluAlaArgLeuMetHisGlnGlyGlnLeuAlaCysGlyGlyAlaLeuValSerGluGluAlaVa 
881 

GCTAACTGCTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGG^ 
iLeuThrAlaAlaHisCysPhelleGlyArgGlnAlaProGluGluTrpSerValGlyLeuGlyThrArgProGluGluT 
961 

GGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTAai(X:CACCCTGAGGGGGGCTAC6ACATGGCCCTCCTGCTGCT^ 
rpGlyLeuLysGlnLeuIlel^uHisGlyAlaTyrThrHisProGluGlyGlyTyrAspHetAlaLeiiLeuLeuLeuAla 
1041 

CAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCCTCTGCCTGCCCTATGCTGACCACCACCTGCCTGATGGGGA^ 
GlnProValThrLeuGlyAlaSerLeuArgProLeuCysLeuProTyrAlaAspHisHisLeuProAspGlyGluArgGl 
1121 

CTGGGTTCTGGGACGGGCCCGCCCAGGAGCAGGCATCy^GCTCCCTCCAGAO^GTGCCCGTGACCCTCCTGG^ 
yTrpValLeuGlyArgAlaArgProGlyAlaGlylleSerSerLeuGlnThrValProValThrLeuLeuGlyProArgA 
1201 

CCTGCAGCCGGCTGCATGCAGCTCCT6GGGGTGATGGCAGCCCTATTCTGCCGGGGATGGTGTGTACCAGTGCTGTGGGT 
laCysSerArgLeuHisAlaAlaProGlyGlyAspGlySerProIleLeuProGlyMetValCysThrSerAlaValGly 
1281 

GAGCTGCCCAGCTGTGAGGTGAGCCCCA6GCCCCCACACCTTACCTAACAGGCCCCTGGCATCCCCTCACCCAATAGCTC 
GluLeuProSerCysGluValSerProArgProProHisLeuThr 
1361 

AAGAACGGACCTTCCAGGCTTGGCCTCTGGACCCACCTCCCACCTGAAGCTAAGCCTTT^ 
1441 

GCCAG 

Fig. 8 
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1 CTTAACAGCCACTTGTTTCATCCCACCTGGGCATTAGGTTGACTT 

4 6 CAAAGATGCCTCAGTTACTGCAAAACATTAATGGGATCATCGAGG 
MetProGlnlieuIjeuGlnAsnXleAsnGlyllelleGluA 

9 1 CCTTCAGGCGCTATGCAAGGACGGAGGGCAACTGCACAGCGCTCA 
laPheArgArgTyrAlaArgThrGluGlyAsnCysThrAlaLeuT 

136 CCCGAGGGGAGCTGAAAA6ACTCTTGGAGC AAGAGTTTGCCGATG 
hrArgGlyGluIieuLysArgLeuLeuGluGlnGluPheAlaAspV 

181 TGATTGTGAAACCCCACGATCC AGC AACTGTGGATGAGGTCCTGC 
allleValLysProHisAspProAlaThrValAspGluVallieiiA 

226 GTCTGCTGGATGAAGACC ACACAGGGACTGTGGAATTCAAGGAAT 
rgLeuljexiAspGlTiAspHisThrGlyThrValGluPheLysGluP 

271 TCCTGGTCTTAGTGTTTAAAGTTGCCC AGGCCTGTTTCAAGACAC 
heLeuValLeuValPheLysValAlaGliiAlaCysPheLysTlirIi 

316 TGAGC6AGAGTGCTGAGGGAGCCTGC6GCTCTCAAGAGTCTGGAA 
euSerGluSerAlaGluGlyAlaCysGlySerGliiGluSerGlyS 

361 GCCTCCACTCTGGGGCCTCGCAGGAGCTGGGCGAAGGACAGAGAA 
erLeuHisSerGlyAlaSerGlnGluLeuGlyGluGlyGliiArgS 

406 GTGGCACTGAAGTGGGAAGGGCGGGGAAAGGGCAGCATTATGAGG 
erGlyThrGluValGlyArgAlaGlyLysGlyGlnHisTyrGluG 

451 GGAGC AGCC ACAGAC AGAGCC AGCAGGGTTCC AGAGGGC AGAAC A 
lySerSerHisArgGlnSerGlnGlnGlySerArgGlyGlnAsnA 

496 GGCCTGGGGTTCAGACCCAGGGTCAGGCCACTGGCTCTGCGTGGG 
rgProGlyValGlnThrGlnGlyGlnAlaThrGlySerAlaTirpV 

541 TC AGC AGCTATGAC AGGC AAGCTGAGTCCCAGAGCCAGGAl^GAA 
alSerSerTyrAspArgGlnAlaGluSerGlnSerGlnGluArgI 

586 TAAGCCCGCAGATACAACTCTCTGGGC AGACAGAGCAGACCCAGA 
leSerProGlnlleGlnlieuSerGlyGlnThrGluGlnThrGlnlj 

631 AAGCTGGA6AA6GCAA6AG6AATCAGACAAC AGAGAT6AG6CCA6 
ysAlaGlyGluGlyLysArgAsnGlnThrThrGliiMetArgProG 

67 6 A6A6ACAGCCACA6ACCAGG6AACAGGACAGAGCCCACCA6ACAG 
luArgGlnProGlnThrArgGluGlnAspArgAlaHisGlnThrG 



Fig. 9 
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721 GTGAGACTGTGACTGGATCTGGAACTCAGACCCAGGCAGGTGCC A 
lyGluThrValThrGlySerGlyThrGlnThrGlnAlaGlyAlaT 

766 CCCAGACTGTGGAGCAGGACAGCAGCCACCAGACAGGAAGCACCA 
hrGlnThrValGluGlnAspSerSerHisGlnThrGlySerThrS 

811 GCACCCAGACAC AGGAGTCCACCAATGGCCAGAACAGAGGGACTG 
erThrGlnThrGlnGluSerThrAsnGlyGlnAsnArgGlyThrG 

856 AGATCC ACGGTC AAGGC AGGAGCC AGACC A6CC AGGCTGTGACAG 
luIleHisGlyGlnGlyArgSerGlnThrSerGlnAlaValThrG 

901 GAGGAC AC ACTC AGATAC AGGC AGGGTC ACACACCGAGACTGTGG 
lyGlyHisThrGlnlleGlnAlaGlySerHisThrGluThrValG 

946 AGC AGGAC AGAAGCC AAACTGTAAGCC ACGGAGGGGCTAGAGAAC 
luGlnAspArgSerGlnThrValSerHisGlyGlyAlaArgGluG 

991 AGGGAC AGACCCAGACGCAGCC AGGCAGTGGTCAAAGATGGATGC 
InGlyGlnThrGlnThrGlnProGlySerGlyGlnArgTrpMetG 

1036 AAGTGAGCAACCCTGAGGCAGGAGAGACAGTACCGGGAGGACAGG 
InValSerAsnProGlviAlaGlyGluThrValProGlyGlyGlnA 

1081 CCCAGACTGGGGC AAGC ACTGAGTCAGGAAGGC AGGAGTGGAGC A 
laGlnThrGlyAlaSerThrGluSerGlyArgGlnGluTrpSerS 

112 6 GCACTCACCCAAGGCGCTGTGTGACAGAAGGGCAGGGAGACAGAC 
erThrHisProArgArgCysValThrGluGlyGlnGlyAspArgG 

1 17 1 AGCCCAC AGTGGTTGGTGAGGAATG6GTTGATGACCACTCAAGGG 
InProThrValValGlyGluGluTrpValAspAspHisSerArgG 

1216 AGAC AGTGATCCTCAGGCTGGACCAGGGCAACTTGCATACCAGTG 
luTlirVal I lelieuArglieuAspGlnGlyAsnLeuHi sThrSerV 

1261 TTTCCTCAGCACAGGGCCAGGATGCAGCCCAGTC AGAAGAGAAGC 
alSerSerAlaGlnGlyGlnAspAlaAlaGlnSerGluGluIiysA 

1306 GAGGCATCACAGCTAGAGAGCTGTATTCCTACTTGAGAAGCACCA 
rgGlylleThrAlaArgGluLeuTyrSerTyrljeuArgSerThrL 

1351 AGCC ATGACTTCCCCGACTCCAATGTCCAGTACTGGAAGAAGACA 
ysPro 

1396 GCTGGAGAGA6TTTGGCTTGTCCTGCATGGCC AATCCAGTGGGTG 

1441 C ATCCCTGGAC ATCAGCTCTTCATTATGCAGCTTCCCTTTTAGGT 

1486 CTTTCTCAATGAGATAATTTCTGCAAGGAGCTTTCTATCCTGAAC 

1531 TCTTCTTTCTTACCTGCTTTGCGGTGCAGACCCTCTCAGGAGCAG 

1576 GAA6ACTCA6AACAAGTCACCCCTT 



Fig. 9 (continued) 
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Figure 15. Nucleotide Sequence for CG106318-01- 



>CG106318-01 4810 nt 

GTCCATGGGGCCGATGTATGGGAGATGAATGTGGTCCCGGAGGCATCCAAACGAGGGCTG 

TGTGGTGTGCTCATGTGGAGGGATGGACTACACTGCATACTAACTGTAAGCAGGCCGAGA 

GACCCAATAACCAGCAGAATTGTTTCAAAGTTTGCGATTGGCACAAAGAGTTGTACGACT 

GGAGACTGGGACCTTGGAATCAGTGTCAGCCCGTGATTTCAAAAAGCCTAGAGAAACCTC 

TTGAGTGCATTAAGGGGGAAGAAGGTATTCAGGTGAGGGAGATAGCGTGCATCCAGAAAG 

ACAAAGACATTCCTGCGGAGGATATCATCTGTGAGTACTTTGAGCCCAAGCCTCTCCTGG 

AGCAGGCTTGCCTCATTCCTTGCCAGCAAGATTGCATCGTGTCTGAATTTTCTGCCTGGT 

CCGAATGCTCCAAGACCTGCGGCAGCGGGCTCCAGCACCGGACGCGTCATGTGGTGGCGC 

CCCCGCAGTTCGGAGGCTCTGGCTGTCCAAACCTGACGGAGTTCCAGGTGTGCCAATCCA 

GTCCATGCGAGGCCGAGGAGCTCAGGTACAGCCTGCATGTGGGGCCCTGGAGCACCTGCT 

CAATGCCCCACTCCCGACAAGTAAGACAAGCAAGGAGACGCGGGAAGAATAAAGAACGGG 

AAAAGGACCGCAGCAAAGGAGTAAAGGATCCAGAAGCCCGCGAGCTTATTAAGAAAAAGA 

GAAACAGAAACAGGCAGAACAGACAAGAGAACAAATATTGGGACATCCAGATTGGATATC 

AGACCAGAGAGGTTATGTGCATTAACAAGACGGGGAAAGCTGCTGATTTAAGCTTTTGCC 

AGCAAGAGAAGCTTCCAATGACCTTCCAGTCCTGTGTGATCACCAAAGAGTGCCAGGTTT 

CCGAGTGGTCAGAGTGGAGCCCCTGCTCAAAAACATGCCATGACATGGTGTCCCCTGCAG 

GCACTCGTGTAAGGACACGAACCATCAGGCAGTTTCCCATTGGCAGTGAAAAGGAGTGTC 

CAGAATTTGAAGAAAAAGAACCCTGTTTGTCTCAAGGAGATGGAGTTGTCCCCTGTGCCA 

CGTATGGCTGGAGAACTACAGAGTGGACTGAGTGCCGTGTGGACCCTTTGCTCAGTCAGC 

AGGACAAGAGGCGCGGCAACCAGACGGCCCTCTGTGGAGGGGGCATCCAGACCCGAGAGG 

TGTACTGCGTGCAGGCCAACGAAAAGCTCCTCTCACAATTAAGTACCCACAAGAACAAAG 

AAGCCTCAAAGCCAATGGACTTAAAATTATGCACTGGACCTATCCCTAATACTACACAGG 

TGTGCCACATTCCTTGTCCAACTGAATGTGAAGTTTCACCTTGGTCAGCTTGGGGACCTT 

GTACTTATGAAAAGTGTAATGATCAGCAAGGGAAAAAAGGCTTCAAACTGAGGAAGCGGC 

GCATTACCAATGAGCCCACTGGAGGCTCTGGGGTAACCGGAAACTGCCCTCACTTACTGG 

AAGCCATTCCCTGTGAAGAGCCTGCCTGTTATGACTGGAAAGCGGTGAGACTGGGAGACT 

GCGAGCCAGATAACGGAAAGGAGTGTGGTCCAGGCACGCAAGTTCAAGAGGTTGTGTGCA 

TCAACAGTGATGGAGAAGAAGTTGACAGACAGCTGTGCAGAGATGCCATCTTCCCCATCC 

CTGTGGCCTGTGATGCCCCATGCCCGAAAGACTGTGTGCTCAGCACATGGTCTACGTGGT 

CCTCCTGCTCACACACCTGCTCAGGGAAAACGACAGAAGGGAAACAGATACGAGCACGAT 

CCATTCTGGCCTATGCGGGTGAAGAAGGTGGAATTCGCTGTCCAAATAGCAGTGCTTTGC 

AAGAAGTACGAAGCTGTAATGAGCATCCTTGCACAGTGTACCACTGGCAAACTGGTCCCT 

GGGGCCAGTGCATTGAGGACACCTCAGTATCGTCCTTCAACACAACTACGACTTGGAATG 

GGGAGGCCTCCTGCTCTGTCGGCATGCAGACAAGAAAAGTCATCTGTGTGCGAGTCAATG 

TGGGCCAAGTGGGACCCAAAAAATGTCCTGAAAGCCTTCGACCTGAAACTGTAAGGCCTT 

GTCTGCTTCCTTGTAAGAAGGACTGTATTGTGACCCCATATAGTGACTGGACATCATGCC 

CCTCTTCGTGTAAAGAAGGGGACTCCAGTATCAGGAAGCAGTCTAGGCATCGGGTCATCA 

TTCAGCTGCCAGCCAACGGGGGCCGAGACTGCACAGATCCCCTCTATGAAGAGAAGGCCT 

GTGAGGCACCTCAAGCGTGCGAAAGCTACAGGTGGAAGACTCACAAATGGCGCAGATGCC 

AATTAGTCCCTTGGAGGGTGCAACAAGACAGGCCTGGAGCACAGGAAGGCTGTGGGCCTG 

GGCGACAGGCAAGAGCCATTACTTGTCGCAAGCAAGATGGAGGACAGGCTGGAATCCATG 

AGTGCCTACAGTATGCAGGCCCTGTGCCAGCCCTTACCCAGGCCTGCCAGATCCCCTGCC 

AGGATGACTGTCAATTGACCAGCTGGTCCAAGTrrrCTTCATGCAATGGAGACTGTGGTG 

CAGTTAGGACCAGAAAGCGCACTCTTGTTGGAAAAAGTAAAAAGAAGGAAAAATGTAAAA 

ATrCCCATTTGTATCCCCTGATTGAGACTCAGTATTGTCCTTGTGACAAATATAATGCAC 

AACCTGTGGGGAACTGGTCAGACTGTATTTTACCAGAGGGAAAAGTGGAAGTGTTGCTGG 

GAATGAAAGTACAAGGAGACATCAAGGAATGCGGACAAGGATATCGTTACCAAGCAATGG 

CATGCTACGATCAAAATGGCAGGCTTGTGGAAACATCTAGATGTAACAGCCATGGTTACA 

TTGAGGAGGCGTGCATCATCCCCTGCCCCTCAGACTGCAAGCTCAGTGAGTGGTCCAACT 

GGTCGCGCTGCAGCAAGTCCTGTGGGAGTGGTGTGAAGGTTCGTTCTAAATGGCTGCGTG 

AAAAACCATATAATGGAGGAAGGGCTTGCCCCAAACTGGACCATGTCAACCAGGCACAGG 

TGTATGAGGTTGTCCCATGCCACAGTGACTGCAACCAGTACCTATGGGTCACAGAGCCCT 

GGAGCATCTGCAAGGTGACCTTTGTGAATATGCGGGAGAACTGTGGAGAGGGCGTGCAAA 

CCCGAAAAGTGAGATGCATGCAGAATACAGCAGATGGCCCTTCTGAACATGTAGAGGATT 

ACCTCTGTGACCCAGAAGAGATGCCCCTGGGCTCTAGAGTGTGCAAATTACCATGCCCTG 

AGGACTGTGTGATATCTGAATGGGGTCCATGGACCCAATGTGTTTTGCCTTGCAATCAAA 

GCAGTTTCCGGCAAAGGTCAGCTGATCCCATCAGACAACCAGCTGATGAAGGAAGATCTT 

GCCCTAATGCTGTTGAGAAAGAACCCTGTAACCTGAACAAAAACTGCTACCACTATGATT 

ATAATGTAACAGACTGGAGTACATGTCAGCTGAGTGAGAAGGCAGTTTGTGGAAATGGAA 

TAAAAACAAGGATGTTGGATTGTGTTCGAAGTGATGGCAAGTCAGTTGACCTGAAATATT 
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GTGAAGCGCTTGGCTTGGAGAAGAACTGGCAGATGAACACGTCCTGCATGGTGGAATGCC 

CTGTGAACTGTCAGCTTTCTGATTGGTCTCGTTGGTCAGAATGTTCTCAAACATGTGGCC 

TCACAGGAAAAATGATCCGAAGACGAACAGTGACCCAGCCCTTTCAAGGTGATGGAAGAC 

CATGCCCTTCCCTGATGGACCAGTCCAAACCCTGCCCAGTGAAGCCTTGTTATCGGTGGC 

AATATGGCCAGTGGTCTCCATGCCAAGTGCAGGAGGCCCAGTGTGGAGAAGGGACCAGAA 

CAAGGAACATTTCTTGTGTAGTAAGTGATGGGTCAGCTGATGATTTCAGCAAAGTGGTGG 

ATGAGGAATTCTGTGCTGACATTGAACTCATTATAGATGGTAATAAAAATATGGTTCTGG 

AGGAATCCTGCAGCCAGCCTTGCCCAGGTGACTGTTATTTGAAGGACTGGTCTTCCTGGA 

GCCTGTGTCAGCTGACCTGTGTGAATGGTGAGGATCTAGGCTTTGGTGGAATACAGGTCA 

GATCCAGACCGGTGATTATACAAGAACTAGAGAATCAGCATCTGTGCCCAGAGCAGATGT 

TAGAAACAAAATCATGTTATGATGGACAGTGCTATGAATATAAATGGATGGCCAGTGCTT 

GGAAGGGCTCTTCCCGAACAGTGTGGTGTCAAAGGTCAGATGGTATAAATGTAACAGGGG 

GCTGCTTGGTGATGAGCCAGCCTGATGCCGACAGGTCTTGTAACCCACCGTGTAGTCAAC 

CCCACTCGTACTGTAGCGAGACAAAAACATGCCATTGTGAAGAAGGGTACACTGAAGTCA 

TGTCTTCTAACAGCACCCTTGAGCAATGCACACTTATCCCCGTGGTGGTATTACCCACCA 

TGGAGGACAAAAGAGGAGATGTGAAAACCAGTCGGGCTGTACATCCAACCCAACCCTCCA 

GTAACCCAGCAGGACGGGGAAGGACCTGGTTTCTACAGCCATTTGGGCCAGATGGGAGAG 

TAAAGACCTGGGTTTACGGTGTAGCAGCTGGGGCATTTGTGTTACTCATCTTTATTGTCT 

CCATGATTTATCTAGCTTGCAAAAAGCCAAAGAAACCCCAAAGAAGGCAAAACAACCGAC 

TGAAACCTTTAACCTTAGCCTATGATGGAGATGCCGACATGTAACATATAACTTTTCCTG 

GCAACAACCA (SEQ ID NO: 40) 



Protein Sequence for CG1 0631 8-01 ORF Start: 18 0RFStop: 4782 Frame: 3 



Protein Sequence: 

>CG1 0631 8^1 -prot 1 588 aa 

MGDECGPGGIQTRAVWCAHVEGWTTLHTNCKQAERPNNQQNCFKVCDWHKELYDWRLGPW 

NQCQPVISKSLEKPLECIKGEEGIQVREIACIQKDKDIPAEDIICEYFEPKPLLEQACLI 

PCQQDCIVSEFSAWSECSKTCGSGLQHRTRHWAPPQFGGSGCPNLTEFQVCQSSPCEAE 

ELRYSLHVGPWSTCSMPHSRQVRQARRRGKNKEREKDRSKGVKDPEARELIKKKRNRNRQ 

NRQENKYWDIQIGYQTREVMCINKTGKAADLSFCQQEKLPMTFQSCVITKECQVSEWSEW 

SPCSKTCHDMVSPAGTRVRTRTIRQFPIGSEKECPEFEEKEPCLSQGDGWPCATYGWRT 

TEWTECRVDPLLSQQDKRRGNQTALCGGGIQTREVYCVQANENLLSQLSTHKNKEASKPM 

DLKLCTGPIPNTTQLCHIPCPTECEVSPWSAWGPCTYENCNDQQGKKGFKLRKRRITNEP 

TGGSGVTGNCPHLLEAIPCEEPACYDWKAVRLGDCEPDNGKECGPGTQVQEWCINSDGE 

EVDRQLCRDAIFPIPVACDAPCPKDCVLSTWSTWSSCSHTCSGKTTEGKQIRARSILAYA 

GEEGGIRCPNSSALQEVRSCNEHPCWYHWQTGPWGQCIEDTSVSSFNTTTTWNGEASCS 

VGMQTRKVICVRVNVGQVGPKKCPESLRPETVRPCLLPCKKDCIVTPYSDWTSCPSSCKE 

GDSSIRKQSRHRVIIQLPANGGRDCTDPLYEEKACEAPQACQSYRWKTHKWRRCQLVPWS 

VQQDSPGAQEGCGPGRQARAITCRKQDGGQAGIHECLQYAGPVPALTQACQIPCQDDCQL 

TSWSKFSSCNGDCGAVRTRKRTLVGKSKKKEKCKNSHLYPLIETQYCPCDKYNAQPVGNW 

SDCILPEGKVEVLLGMKVQGDIKECGQGYRYQAMACYDQNGRLVETSRCNSHGYIEEACI 

IPCPSDCKLSEWSNWSRCSKSCGSGVKVRSKWLREKPYNGGRPCPKLDHVNQAQVYEWP 

CHSDCNQYLWVTEPWSICKVTFVNMRENCGEGVQTRKVRCMQNTADGPSEHVEDYLCDPE 

EMPLGSRVCKLPCPEDCVISEWGPWTQCVLPCNQSSFRQRSADPIRQPADEGRSCPNAVE 

KEPCNLNKNCYHYDYNVTDWSTCQLSEKAVCGNGIKTRMLDCVRSDGKSVDLKYCEALGL 

EKNWQMNTSCMVECPVNCQLSDWSPWSECSQTCGLTGKMIRRRTVTQPFQGDGRPCPSLM 

DQSKPCPVKPCYRWQYGQWSPCQVQEAQCGEGTRTRNISCWSDGSADDFSKWDEEFCA 

DIELIIDGNKNMVLEESCSQPCPGDCYLKDWSSWSLCQLTCVNGEDLGFGGIQVRSRPVI 

IQELENQHLCPEQMLETKSCYDGQCYEYKWMASAWKGSSRTVWCQRSDGINVTGGCLVMS 

QPDADRSCNPPCSQPHSYCSETKTCHCEEGYTEVMSSNSTLEQCTLIPWVLPTMEDKRG 

DVKTSRAVHPTQPSSNPAGRGRTWFLQPFGPDGRLKTWVYGVAAGAFVLLIFIVSMIYLA 

CKKPKKPQRRQNNRLKPLTLAYDGDADM (SEQ ID NO: 41) 
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Figure 16. Nucleotide and Protein Sequences for CG50817-04. 



>CG5081 7^4 1447 nt 

GCGGACACCAGTGATGCTCCTGGGACCCTACGCAATCTGCGCCTGCGTCTCATCAGTCGC 

CCCACATGTAACTGTATCTACAACCAGCTGCACCAGCGACACCTGTCCAACCCGGCCCGG 

CCTGGGATGCTAT6TGGG6GCCCCCAGCCTGGGGTGCAGGGCCCCTGTCAGGTCTGATAG 

GGAGAAGAGAAGGAGCAGAAGGGGAGGGGCCTAACCCTGGGCTGGGGGTTGGACTCACAG 

GACTGGGGGAAAGAGCTGCAATCAGAGGGTGTCTGCCATAGCTGGGCTCAGGCATCTGTC 

CTTGGCTTTGTTGCCTGGCTCCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCT 

GACGGACACTGGGTTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGAC 

GCTCCTGTGCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAG 

GGGGCAGCTTTCCTGGCCCAGAGCCGAGAGACCCCGGAGATGAGTGATGAGGACAGCTGT 

GTAGCCTGTGGATCCTTGAGGAGAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCC 

TGGGAGGCCAGGCTGATGCACCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTGTCAGAG 

GAGGCGGTGCTAACTGCTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGC 

GTAGGGCTGGGGACCAGACCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCC 

TACACCCACCCTGAGGGGGGCTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACA 

CTGGGAGCCAGCCTGCGGCCCCTCTGCCTGCCCTATGCTGACCACCACCTGCCTGATGGG 

GAGCGTGGCTGGGTTCTGGGACGGGCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACA 

GTGCCCGTGACCCTCCTGGGGCCTAGGGCCTGCAGCCGGCTGCATGCAGCTCCT6GGGGT 

GATGGCAGCCCTATTCTGCCGGGGATGGTGTGTACCAGTGCTGTGGGTGAGCTGCCCAGC 

TGTGAGGCCAACCAACCAGCTGCTGACAGGGGACCTGGCCATTCTCAGGAACAAGAGAAT 

GCAGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAG 

GCACCAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGG 

TGCCCACTCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGC 

TACCAAGCAGGCGTCTCAGCTTTCCTCCTCCTTTACCCTTTCAGATACAATCACGCCAGC 

CACGTTGTTTTGAAAATTTCI I I I I I I GGGGGGCAGCAGTTTTCC I 1 11 I I l AAACTTAA 

ATAAATT (SEQ ID NO:42) 

Protein Sequence for CG50817-04ORF Start: 520 ORF Stop: 1192 Frame: 1 

Protein Sequence: 
>CG50817-04-prot 224 aa 

MSDEDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGR 
QAPEEWSVGLGTRPEEWGLKQLILHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPYA 
DHHLPDGERGWVLGRARPGAGISSLQTVPVTLLGPRACSRLHAAPGGDGSPILPGMVCTS 
AVGELPSCEANQPAADRGPGHSQEQENAGRQMALLPLSSPPCHV (SEQ ID NO:43) 



Figure 17. Nucleotide and Protein Sequences for CG50817-05. 



. Nucleotide sequence encoding the Peptidase-like protein of the invention. 

>CG50817-05 

CGCTGGGCCTCTGTCCTG ATGCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTAC 60 
CTGGCCTGGATCCTGTTCTTCGTGCTCTATGATTTCTGCATTGTTTGTATCACCACCTAT 120 
GCTATCAACGTGAGCCTGATGTGGCTCAGTTTCCGGAAGGTCCAAGAACCCCAGGGCCAA 180 
CCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCGAGTGGCCCTGGCAGGCCAGTGTGAGG 240 
AGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACACCTGGGTCCTCACTGCT 300 
GCCCACTGCnTGAAAAGGCAGCAGCAACAGAACTGAATTCCTGCGTGAGGGACTCAGCC 360 
CCTGGGGCCGAAGAGGTGGGGGTGGCTGCCCTGCAGTTGCCCAGGGCCTATAACCACTAC 420 
AGCCAGGGCTCAGACCTGGCCCTGCTGCAGCTCGCCCACCCCACGACCCACACACCCCTC 480 
TGCCTGCCCCAGCCCGCCCATCGCTTCCCCTTTGGAGCCTCCTGCTGGGCCACTGGCTGG 540 
GATCAGGACACCAGTGATGCTCCTGGGACCCTACGCAATCTGeGCCTGCGTCTCATCAGT 600 
CGCCCCACATGTAACTGTATCTACAACCAGCTGCACCAGCGACACCTGTCCAACCCGGCC 660 
CGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTGGGGTGCAGGGCCCCTGTCAGGGAGAT 720 
TCCGGGGGCCCTGTGGTGTGCCTCGAGCCTGACGGACACTGGGTTCAGGCTGGCATCATC 780 
AGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGTGCTGCTGACCAACACAGCTGCT 840 
CACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTTTCCTGGCCCAGAGCCCAGAG 900 
ACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGATCCTTGAGGACAGCAGGT 960 
CCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCTGATGCACCAGGGACAG 1020 
CTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTGCTGCCCACTGCTTC 1080 
ATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGACCGGAGGAGTGG 1 1 40 
GGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGGCTACGACATG 1200 
GCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCCTCTGCCTG 1260 
CCCTATGCTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGGGCCCGC 1320 
CCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAGGGCC 1380 
TGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGGTG 1440 
TGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGCCAACCAACCAGCTGCTGACAGG 1500 
GGACCTGGCCATTCTCAGGAACAAGAGAATGCAGGCAGGCAAATGGCATTACTGCCCCTG 1560 
TCCTCCCCACCCTGTCATGTGTG ATTCCAGGC 1 592 

(SEQ ID NO:44) 



Protein sequence encoded by the coding sequence shown above. 

>CG50817-05 

MLLSSLVSLAGSVYLAWILFFVLYDFCIVCITTYAINVSLMWLSFRKVQEPQGQPKPQEG 60 
NTVPGEWPWQASVRRQGAHICSGSLVADTWVLTAAHCFEKAAATELNSCVRDSAPGAEEV 1 20 
GVAALQLPRAYNHYSQGSDLALLQLAHPTTHTPLCLPQPAHRFPFGASCWATGWDQDTSD 180 
APGTLRNLRLRLISRPTCNCIYNQLHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVL 240 
CLEPDGHWVQAGIISFASSCAQEDAPVLLTNTAAHSSWLQARVQGAAFLAQSPETPEMSD 300 
EDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGRQAP 360 
EEWSVGLGTRPEEWGLKQLILHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPYADHH 420 
LPDGERGWVLGRARPGAGISSLQTVPVTLLGPRACSRLHAAPGGDGSPILPGMVCTSAVG 480 
ELPSCEANQPAADRGPGHSQEQENAGRQMALLPLSSPPCHV 521 

(SEQ ID NO:45) 
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Figure 18. Nucleotide and Protein Sequences for CG50817-06. 



Nucleotide sequence encoding the Peptidase-lilce protein of the invention. 

>CG50817-06 

AGCGACACCTGTCCAACCCGGCCCGGCCTGGG ATGCTATGTGGGGGCCCCCAGCCTGGGG 60 
TGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGAC 120 
ACTGGGTTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTG 180 
TGCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAG 240 
CTTTCCTGGCCCAGAGCCCAGAGACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCT 300 
GTGGATCCTTGAGGACAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGG 360 
CCAGGCTGATGCACCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGG 420 
TGCTAACTGGTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGC. 480 
TGGGGACCAGACCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCC 540 
ACCCTGAGGGGGGCTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAG 600 
CCAGCCTGCGGCCCCTCTGCCTGCCCTATGCTGACCACCACCTGCCTGATGGGGAGCGTG 660 
GGTGGGTTCTGGGACGGGCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCG 720 
TGACCCTCCTGGGGCCTAGGGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCA 780 
GCCCTATTCTGCCGGGGATGGTGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGG 840 
CCAACCAACCAGCTGCTGACAGGGGACGTGGCCATTCTCAGGAACAAGAGAATGCAGGCA 900 
GGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTG ATTCCAGGCACCAG 960 
GGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCAC 1020 
TCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAG 1080 
CAGGCGTCTCAGCTTTCCTCCTCCTTTACCCTTTCAGATACAATCACGCCAGCCACGTTG 1140 
TTTTGAAAATTTCI H I H I GGGGGGCAGCAGTTTTCC I I t I I [ f AAACTTAAATAAATT 1200 

(SEQ ID NO:46) 



Protein sequence encoded by the coding sequence shown above. 

>CG50817-06 

MLCGGPQPGVQGPCQGDSGGPVLCLEPDGHWVQAGIISFASSCAQEDAPVLLTNTAAHSS 60 
WLQARVQGAAFLAQSPETPEMSDEDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGQLAC 1 20 
GGALVSEEAVLTAAHCFIGRQAPEEWSVGLGTRPEEWGLKQLILHGAYTHPEGGYDMALL 180 
LLAQPVTLGASLRPLCLPYADHHLPDGERGWVLGRARPGAGISSLQTVPVTLLGPRACSR 240 
LHAAPGGDGSPILPGMVCTSAVGELPSCEANQPAADRGPGHSQEQENAGRQMALLPLSSP 300 
PCHV 304 
(SEQ ID N0:47) 
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Figure 19. Nucleotide and Protein Sequences For CG51099-03. 



Nucleotide sequence encoding the Serine Protease-like protein of the invention. 



>CG51099-03 

CGGAGAGACGCAGTCGGCTGCCACCCCGGG ATGGGTCGCTGGTGCCAGACCGTCGCGCGC 6 0 

GGGCAGCGCCCCCGGACGTCTGCCCCCTCCCGCGCCGGTGCCCTGCTGCTGCTGCTTCTG 120 

TTGCTGAGGTCTGCAGGTTGCTGGGGCGCAGGGGAAGCCCCGGGGGCGCTGTCCACTGCT 180 

GATCCCGCCGACCAGAGCGTCCAGTGTGTCCCCAAGGCCACCTGTCCTTCCAGCCGGCCT 2 4 0 

CGCCTTCTCTGGCAGACCCCGACCACCCAGACACTGCCCTCGACCACCATGGAGACCCAA 3 00 

TTCCCAGTTTCTGAAGGCAAAGTCGACCCATACCGCTCCTGTGGCTTTTCCTACGAGCAG 3 60 

GACCCCACCCTCAGGGACCCAG7U!^GCCGTGGCTCGGCGGTGGCCCTGGATGGTCAGCGTG 42 0 

CGGGCCAATGGCACACACATCTGTGCCGGCACCATCATTGCCTCCCAGTGGGTGCTGACT 4 80 

GTGGCCCACTGCCTGATCTGGCGTGATGTTATCTACTCAGTGAGGGTGGGGAGTCCGTGG 54 0 

ATTGACCAGATGACGCAGACCGCCTCCGATGTCCCGGTGCTCCAGGTCATCATGCATAGC 600 

AGGTACCGGGCCCAGCGGTTCTGGTCCTGGGTGGGCCAGGCCAACGACATCGGCCTCCTC 66 0 

AAGCTCAAGCAGGAACTCAAGTACAGCAATTACGTGCGGCCCATCTGCCTGCCTGGCACG 72 0 

GACTATGTGTTGAAGGACCATTCCCGCTGCACTGTGACGGGCTGGGGACTTTCCAAGGCT 780 

GACGGC ATGTGGCCTCAGTTCCGGACCATTCAGGAGAAGGAAGTCATCATCCTGAACAAC 840 

AAAGAGTGTGACAATTTCTACCACAACTTCACCAAAATCCCCACTCTGGTTCAGATCATC 900 

AAGTCCCAGATGATGTGTGCGGAGGACACCCACAGGGAGAAGTTCTGCTATGAGCTAACT 960 

GGAGAGCCCTTGGTCTGCTCCATGGAGGGCACGTGGTACCTGGTGGGATTGGTGAGCTGG 102 0 

GGTGCAGGCTGCCAGAAGAGCGAGGCCCCACCCATCTACCTACAGGTCTCCTCCTACCAA 10 8 0 

CACTGGATCTGGGACTGCCTCAACGGGCAGGCCCTGGCCCTGCCAGCCCCATCCAGGACC 114 0 

CTGCTCCTGGCACTCCCACTGCCCCTCAGCCTCCTTGCTGCCCTCTG ACTCTGTGTGCCC 1200 

TCCCTCACTTGTGA 1214 
(SEQ ID NO:48) 

Protein sequence encoded by the nucleotide sequence shown above. 



>CG51099-03 

MGRWCQTVARGQRPRTSAPSRAGALLLLLLLLRSAGCWGAGEAPGALSTADPADQSVQCV 60 
PKATCPSSRPRLLWQTPTTQTLPSTTMETQFPVSEGKVDPYRSCGFSYEQDPTLRDPEAV 1 20 
ARRWPWMVSVRANGTHICAGTIIASQWVLTVAHCLIWRDVIYSVRVGSPWIDQMTQTASD 1 80 
VPVLQVIMHSRYRAQRFWSWVGQANDIGLLKLKQELKYSNYVRPICLPGTDYVLKDHSRC 240 
TVTGWGLSKADGMWPQFRTIQEKEVIILNNKECDNFYHNFTKIPTLVQIIKSQMMCAEDT 300 
HREKFCYELTGEPLVCSMEGTWYLVGLVSWGAGCQKSEAPPIYLQVSSYQHWIWDCLNGQ 360 
ALALPAPSRTLLLALPLPLSLLAAL 385 (SEQ ID NO:49) 
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Figure 20. Nucleotide and Protein Sequences For CG57051-04, 



Nucleotide sequence encoding the Angiopoietin-like protein, CG57051-04. 



>CG57051-04 

TGCGGATCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAAGCGGGT 60 
CTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGAGAG 120 
TCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGGA TGAGCGGTGCTCCGACGGCCGGGGC 180 
AGGCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTAGATCTGGACCCGTGCA 240 
GTCCAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCT 300 
GCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCT 360 
GGAGCGGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCT 420 
CCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACT 480 
CAAGGCTCAGAACAGCAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCA 540 
CCTGGAGAAGCAGCACCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCTCCTGGACCA 600 
CAAGCACCTAGACCATGAGGTGGCCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGC 660 
CCAGCCAGTTGACCCGGGTCACAATGTCAGCCGCCTGCACCGAGGCTGGTGGTTTGGCAC 720 
CTGCAGCCATTCCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACAGCAGCGGCAGAA 780 
GCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTACTACCCGCTGCAGGCCAC 840 
CACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCTAG CGTCCTGGCTGGGCCTG 900 
GTCCCAGGCCCACGAAAGACGGTGACTCTTGGCTCTG 937 (SEQ ID NO:50) 

Protein sequence encoded by the nucleotide sequence shown above. 



>CG57051-04 

MSGAPTAGAALMLCAATAVLLSARSGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHAE 60 
RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 
HKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSR 180 
LHRGWWFGTCSHSNLNGQYFRSIPQQRQKLKKGIFWKTWRGRYYPLQATTMLIQPMAAEA 240 
AS 242 (SEQIDNO:51) 



Figure 21. Nucleotide and Protein Sequences For CG57051-05. 

Nucleotide sequence encoding the Angiopoietin-like protein, CG57051-05. 



>CG57051-05 

CTTCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGAGAGTCCCCGAATCCCCGC 60 

TCCCAGGCTACCTAAGAGG ATGAGCGGCGCTCCGACGGCCGGGGCAGCCCTGATGCTCTG 12 0 

CGCCGCCACCGCCGTGCTACTGAGCGCTCAGGGCGGACCCGTGCAGTCCAAGTCGCCGCG 180 

CTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCTGCAGCTCGGCCAGGG 24 0 

GCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCTGGAGCGGCGCCTGAG 300 

CGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCTCCCGTTAGCCCCTGA 360 

GAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACTCAAGGCTCAGAACAG 420 

CAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCACCTGGAGAAGCAGCA 4 80 

CCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCTCCTGGACCACAAGCACCTAGACCA 54 0 

TGAGGGTGGCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCC 600 

GGCTCACT^TGTCAGCCGCCTGCACCATGGAGGCTGGACAGTAATTCAGAGGCGCCT^CGA 660 

TGGCTCAGTGGACTTCAACCGGCCCTGGGAAGCCTACAAGGCGGGGTTTGGGGATCCCCA 720 

CGGCGAGTTCTGGCTGGGTCTGGAGAAGGTGCATAGCATCATGGGGGACCGCAACAGCCG 780 

CCTGGCCGTGCAGCTGCGGGACTGGGATGGCAACGCCGAGTTGCTGCAGTTCTCCGTGCA 840 

CCTGGGTGGCGAGGACACGGCCTATAGCCTGCAGCTCACTGCACCCGTGGCCGGCCAGCT 900 

GGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGTACCCTTCTCCACTTGGGACCAGGA 960 

TCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAGCCTCTCTGGAGGCTGGTGGTTTGG 1020 

CACCTGCAGCCATTCCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACAGC AGCGGCA 1080 

GAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTACTACCCGCTGCAGGC 114 0 

CACCACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCTA GCGTCCTGGCTGGGC 1200 
CTGGTCCCAGGCCCACGAAAGAGGTGACTCTTGGCTCTG 1239 (SEQ ID NO: 52) 

Protein sequence for Angiopoietin-like protein, CG57051-05. 



>CGS7051-05 

MSGAPTAGAALMLCAATAVLLSAQGGPVQSKSPRFASWDEMNVIiAHGLLQLGQGLREHAE 60 

RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 

HKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEGGKPARRKRLPEMAQPVDPAHNVSR 180 

LHHGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEKVHSIMGDRNSRLAVQLR 240 

DWDGNAELLQFSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQDHDLRRD 300 

KNCAKSLSGGWWFGTCSHSNLNGQYFRS I PQQRQKLKKGI FWKTWRGRYYPLQATTMLIQ 360 
PMAAEAAS 368 (SEQ ID NO: 53) 
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Figure 22. Nucleotide and Protein Sequences For CG57051-02. 

Nucleotide sequence encoding tiie Angiopoietin-like protein of the inyention. 



>CG57051__02 

TGCGGATCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAAGCGGGT 60 

CTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGAGAG 12 0 

TCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGG ATGAGCGGTGCTCCGACGGCCGGGGC 180 

AGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTAGATCTGGACCCGTGCA 24 0 

GTCCAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCT 3 00 

GCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCT 360 

GGAGCGGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCT 42 0 

CCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACT 4 80 

CAAGGCTCAGAACAGCAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCA 54 0 

CCTGGAGAAGCAGCACCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCTCCTGGACCA 600 

CAAGCACCTAGACCATGAGGTGGCCAAACCTGCCCGAAGAAAGAGGCTGCCCGAGATGGC 660 

CCAGCC AGTTG ACCCGGCTCACAATGTCAGCCGCCTGCACCATGGAGGCTGGACAGTAAT 720 

TCAGAGGCGCCACGATGGCTCAATGGACTTCAACCGGCCCTGGGAAGCCTACAAGGCGGG 780 

GTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGGAGAAGGTGCATAGCATCACGGG 840 

GGACCGCAAC7VGCCGCCTGGCCGTGCAGCTGCGGGACTGGGATGGCAACGCCGAGTTGCT 900 

GCAGTTCTCCGTGC7VCCTGGGTGGCGAGGACACGGCCTATAGCCTGCAGCTCACTGC ACC 960 

CGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGTACCCTTCTC 1020 

CACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAGCCTCTCTGC 1080 

CCCATCGGTGGCTCAAAGACCTGACCATGTTCCCTCTCCCCTGACCCCGGCAGGAGGCTG 114 0 

GTGGTTTGGCACCTGCAGCCATTCC AACCTCAACGGCCAGTACTTCCGCTCC ATCCCACA 1200 

GCAGCGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCT ACTACCC 1260 
GCTGCAGGCCACCACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCT AG 1315 
(SEQ ID NO: 54) 



Protein sequence for CG57051-02. 

>CG57051_02 

MSGAPTAGAALMLCAATAVLLSARSGPVQSKSPRFASWDEMNVIiAHGLLQLGQGLREHAE 60 

RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 12 0 

HKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSR 180 

liHHGGWTVIQRRHDGSMDFNRPWEAYKAGFGDPHGEFWLGLEKVHSITGDRNSRIiAVQLR 24 0 

DWDGNAELLQFS VHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQDHDLRRD 300 

KNCAKSLSAPSVAQRPDHVPSPLTPAGGWWFGTCSHSNLNGQYFRSIPQQRQKLKKGIFW 36 0 
KTWRGRYYPLQATTMLIQPMAAEAAS 386 (SEQ ID NO: 55) 
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Figure 23. Nucleotide and Protein Sequences For CG57051-03. 

Nucleotide sequence encoding the Angiopoietin-like protein, CG57051-03. 



>CG57051-03 

CCCCGAGAGTCCCCGAATCCCCGCTCCCAGGCTACCT7y\GAGG ATQAGCGGTGCTCCGAC 6 0 

GGCCGGGGCAGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTCAGGGCGG 120 

ACCCGTGCAGTCCAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCA 180 

CGGACTCCTGCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCT 24 0 

GAGCGCGCTGGAGCGGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTC 3 00 

CACCGACCTCCCGTTAGCCCCTGAGAGCCGGGTGG ACCCTGAGGTCCTTCACAGCCTGCA 360 

GACACAACTCAAGGCTCAGAACAGCAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCA 420 

GCAGCGGCACCTGGAGAAGCAGCACCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCT 480 

CCTGGACC ACAAGCACCTAGACCATGAGGTGGCCAAGCCTGCCCGAAGAAAGAGGCTGCC 54 0 

CGAGATGGCCCAGCCAGTTGACCCGGCTCACAATGTCAGCCGCCTGCACCATGGAGGCTG 600 

GACAGTAATTCAGAGGCGCCACGATGGCTCAGTGGACTTCAACCGGCCCTGGGAAGCCTA 660 

CAAGGCGGGGTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGGAGAAGGTCCATAG 720 

CATCACGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACTGGGATGACAACGC 780 

CGAGTTGCTGCAGTTCTCCGTGCACCTGGGTGGCGAGGACACGGCCTATAGCCTGCAGCT 840 

CACTGCACCCGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGT 900 

ACCCTTCCCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAG 960 

CCTCTCTGGAGGCTGGTGGTTTGGCACCTGCAGCCATTCCAACCTCAACGGCCAGTACTT 1020 

CCGCTCCATCCCACAGCAGCGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCG 1080 

GGGCCGCTACTACCCGCTGCAGGCCACCACC ATGTTGATCCAGCCCATGGCAGC AGAGGC 1140 
AGCCTCCTAG 1150 (SEQ ID NO: 56) 



Protein sequence for CG57051-03. 



>CG57051-03 

MSGAPTAGAALMLCAATAVLLSAQGGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHAE 60 

RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 

HKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSR 180 

LHHGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEKVHS I TGDRNSRLAVQLR 24 0 

DWDDNAEIiLQFSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFPTWDQDHDLRRD 3 00 

KNCAKSLSGGWWFGTCSHSNLNGQYFRS I PQQRQKLKKGI FWKTWRGRYYPLQATTMLIQ 3 60 
PMAAEAAS 368 (SEQ ID NO: 57) 



